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PREFACE 



In March of 2004, 1 had the pleasure of giving eight lectures as part of the 

Pathways Lecture Series in Mathematics, Keio 
which is supported by 

The Twenty-first Century COE Program at Keio 
Integrative Mathematical Sciences: 
Progess in Mathematics Motivated by 
Natural and Social Phenomena 

As explained in Lecture 1, these lectures cover material that I had just finished 
writing, and which I hope will constitute the first part of a book on Mechanics for 
Mathematicians. But I tried to present them as course lectures in development, 
rather than as a finished product, and I benefited immensely from this, since 
such a presentation, and the reactions of the listeners, raised all sorts of questions 
that I hadn't thought of, and which will eventually result in a rather thorough 
rewriting of the original material. 

In these notes, I have tried to preserve the informal nature of these lectures, 
aiming for spontaneity, rather than the more carefully arrangement of material 
one expects to find in a book, so that they will complement the book (if it 
ever appears) rather than merely serving as a repetition; however, some parts, 
especially the notes for the later lectures, are taken rather directly from parts of 
the book already written. Though the notes do not reproduce the material of 
the lectures in exactiy the same order as they were given, they do cover basically 
the same material. 

I am grateful to Prof. Yoshiaki Maeda for inviting me to give these lectures, 
and to Prof. Martin Guest of Tokyo Metropolitan University, where two of these 
lectures were also given. I greatly enjoyed their hospitality and the experience 
of living in Tokyo. 
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THE HARDEST 
PART OF MECHANICS 
(THE FUNDAMENTALS) 

These lectures are based on a book that I am writing, or at least trying to 
write. For many years I have been saying that I would like to write a book (or 
series of books) called Physics for Mathematicians. Whenever I would tell people 
that, they would say, Oh good, you're going to explain quantum mechanics, or 
string theory, or something like that. And I would say, Well that would be nice, 
but I can't begin to do that now; first I have to learn elementary physics, so the 
first thing I will be writing will be Mechanics for Mathematicians. 

So then people would say, Ah, so you're going to be writing about symplectic 
structures, or something of that sort. And I would have to say, No, I'm not 
trying to write a book about mathematics for mathematicians, I'm trying to write 
a book about physics for mathematicians; of course, symplectic structures will 
eventually make an appearance, but the problem is that I could easily under- 
stand symplectic structures, it's elementary mechanics that I don't understand. 

Then people would look at me a little strangely, so I'd better explain what I 
mean. When I say that I don't understand elementary mechanics, I mean, for 
example, that I don't understand this: 



Of course, everyone knows about levers. They are so familiar that most of us 
have forgotten how wonderful a lever is, how great a surprise it was when we 
first saw a small body balancing a much bigger one. Most of us also know the 
law of the lever, but this law is simply a quantitative statement of exactly how 
amazing the lever is, and doesn't give us a clue as to why it is true, how such a 
small force at one end can exert such a great force at the other. 

Now physicists all agree that Newton's Three Laws are the basis from which 
all of mechanics follows, but it you ask for an explanation of the lever in terms 
of these three laws, you will almost certainly not get a satisfactory answer. You 
might be told something about conservation of angular momentum, or perhaps 
even the principle of virtual work, but I can almost guarantee that as soon as 
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some one starts to give an answer you can almost certainly interrupt them and 
explain why their answer can't be correct. For, the correct answer must begin 
by saying "First we have to understand rigid bodies", since, after all, the rigidity 
of the lever is an absolute necessity for it to work, and if one hasn't already 
analyzed rigid bodies, then one simply isn't in a position to give an explanation 
of the lever. 

Well, we won't get to rigid bodies for a few lectures yet, so it'll be a while 
before we can give a good answer to this question. 

In these lectures I am basically going to cover (portions of) the part of the 
book that has been written so far. The final book will contain 4 parts: 

I. Fundamentals 

II. Applications 

III. Lagrangian Mechanics 

IV Hamiltonian Mechanics, 

with perhaps a fifth part, 

V Abstractions to Lie groups. 

I hope to finish this book in about a year. So far, I have written just Part I, 
and it has taken me nearly a year and a half, 1 but that doesn't mean that my 
hope is necessarily unrealistic. After all, Part I is the hard part, all about the 
basic physical ideas, while the remaining parts are basically mathematics. 

We are going to be considering the foundations of mechanics by starting right 
at the source, Newton's Philosophise Naturalis Principia Mathematica or Mathematical 
Principles of Natural Philosophy, in English, or simply The Principia. 

Now this book is one of the great classics, probably the greatest book in all of 
physics, but that doesn't mean that some one should try learning physics from 
it! Like many a classic, it is basically unreadable. To begin with, it's in Latin, 
and I don't even think there is a Japanese translation. Fortunately, there is a 
very good recent English translation 

Newton, The Principia. Translated by Bernard Cohen and Anne Whitman. 

and nearly half of this hefty tome is a guide to reading the actual translation; 
see [N-C-W] in the References at the end of these notes for further details. Any 
quotations from the Principia are taken from this book. 

Newton obviously wrote the Principia with Euclid's Elements in mind. In 
fact, after the usual beginning stuff, the title page, various prefaces for various 
editions, both by Newton and his editor, and a long ode to Newton written by 
Halley (which I haven't read, but have grave doubts of being very good), the 
very first words of the Principia proper are 

1 Actually about 2 years, when you count the rewriting that I felt was necessary after 
giving these lectures. 



4 



LECTURE 1 



DEFINITIONS 



Definition 1 Quantity of matter is a measure of matter that arises from its density and 
volume jointly. 



Now of course anyone can see that this definition isn't very useful, in fact 
obviously circular, since density is usually defined as mass per unit volume. In 
[N-C-W], and elsewhere, you can find long discussions about this, but we don't 
really need to be concerned. Newton is basically just trying to get started, and 
he will tie things together better as he goes along. In fact, this definition is 
immediately followed by further discussion: 



Definition 1 Quantity of matter is a measure of matter that arises from its density and 
volume jointly. 

If the density of air is doubled in a space that is also doubled, 
there is four times as much air, and there is six times as much if 
the space is tripled. The case is the same for snow and powders 
condensed by compression or liquefaction, .... Furthermore, I 
mean this quantity whenever I use the term "body" or "mass" in 




The three dots here indicate omissions that aren't really very important (you'll 
have to take my word for this), but there's also a portion that has been temporar- 
ily shaded out. This part, by contrast, is very important, but it's very confusing 
that Newton put it right here, and we'll come back to it later! 

Naturally, these additional remarks don't clarify the concept very much, but 
at least they help a little, and in particular Newton has now introduced the 
important term "mass" instead of the awkward phrase "quantity of matter". It's 
something of a distraction that he also uses the word "body", because nowadays 
we think of a body as some object, that has a particular mass, so we won't follow 
Newton's usage at all, but will always use the word "mass", whatever in the world 
that is eventually going to mean. 

The next definition is a lot simpler, 
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Definition 2 Quantity of motion is a measure of motion that arises from the velocity and the 
quantity of matter jointly. 



and we merely want to point out that nowadays this is what we call "momen- 
tum". 

The third definition, though just as vague as the others, finally gives us some 
idea of what "mass" is really supposed to mean. 



Definition 3 Inherent force of matter is the power of resisting by which every body, so far as 
it is able, perseveres in its state either of resting or of moving uniformly straight 
forward. 

This force is always proportional to the body [i.e., mass] and 
does not differ in any way from the inertia of the mass except in 
the manner it which it is conceived. . . . 



This is certainly a strange way of speaking — nowadays, we don't speak of the 
"force of matter" — but, in short, what Newton is saying is that "mass" is basically 
what we call "inertia", a measure of how hard it is to get something moving. If 
I were teaching an introductory mechanics course for physics students I would 
provide two balls like these [two balls made from styrofoam semi-spheres, iden- 
tical in appearance], give the first a slight push, so that it would start rolling 
rapidly, and then ask a brawny-looking student to repeat this with the second. 
[The second ball was filled with heavy chain, and barely moved when pushed 
a great deal harder.] This little experiment shows that the second ball has a 
much greater mass than the first (although the student, if asked, would proba- 
bly incorrectly conclude that the second ball weighed a lot more than the first, 
which is something quite different, that we'll get to in a bit). 

Of course, all of this is merely vaguely descriptive, and the remainder of 
this first section of the Principia, in which Newton adds a few other definitions 
doesn't help much. 

So it's time to turn to the second section, the axioms: 



AXIOMS, OR THE LAWS OF MOTION 



Law 1 Every body perseveres in its state of being at rest or of moving uniformly straight 
forward, except insofar as it is compelled to change its state by forces impressed. 
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Of course, Newton didn't call this Newton's First Law, and, as we will see 
later on, he explicitly credits the law to Galileo. Galileo didn't exactly demon- 
strate this law, but instead explained why everyday experiences might seem to 
contradict it. Not too long ago the law might be illustrated dramatically by 
sliding an object along a glass table with dry ice evaporating from it, forming a 
cushion of gas that practically eliminates friction. Recendy the invention of the 
"air-trough" (see reference [N-L]) has provided a nice way to illustrate Newton's 
first law in the classroom: 




One gives the block just the tiniest nudge, and watches it glide at constant 
velocity to the end of the track. 

In an elementary physics course, considerable discussion about coordinate 
systems might be required here, but we'll simply point out that we are basically 
assuming that there are certain coordinate systems ("inertial systems") in which 
this law holds, and that any system moving at uniform velocity with respect to 
an inertial system obviously has the same properly. So we pass on immediately 
to 



Law 2 A change in motion is proportional to the motive force impressed and takes place along 
the straight line in which that force is impressed. 

If some force generates any motion, twice the force will generate twice 




the motion, .... 

ISSP^ 





Once again, we have that obnoxious shaded region, which we'll unveil at the 
appropriate time. 

Of course, nowadays we simply state the second law as 



F = ma, 
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where a is the acceleration of our body. Newton speaks in terms of (discrete) 
"changes in motion" because he often thinks in terms of what we would call 
"instantaneous" forces, exerted only for a very short time. 

This law might seem to be virtually meaningless, since we haven't said how 
to measure mass m, nor how to measure force F. But there's really more to 
it than appears, as we will see by skipping to the discussion ("Scholium") that 
Newton gives at the end of this section of the Principia, where he begins by ac- 
knowledging Galileo (again, some material has been shaded out, to be unveiled 
at a later date): 



Scholium The principles I have set forth are accepted by mathematicians and 
confirmed by experiments of many kinds. By means of the first two 



Galileo found that the descent of heavy 
V, 



laws 

bodies is in the squared ratio of the time 
///////////////////////77/ as experiment confirms, except insofar 
as these motions are somewhat retarded by the resistance of the air. 



In the first and second editions of the Principia, this is all that Newton says, 
and it might seem rather mysterious — what does squared ratio of the time have 
to do with the second law? As we will see once again in a future lecture, 
Newton's exposition often suffered from a defect common to many very bright 
people — he often didn't realize that what was obvious to him might not be 
obvious to others — and it was only in the third edition that he added a more 
detailed explanation: 



Scholium The principles I have set forth are accepted by mathematicians and 

confirmed by experiments of many kinds. By means of the first two 

Galileo found that the descent of heavy 
V, 



laws 

bodies is in the squared ratio of the time 
Z//////////////////////77/ as experiment confirms, except insofar 
as these motions are somewhat retarded by the resistance of the air. 
When a body falls, uniform gravity, by acting equally in individual 
equal particles of time, impresses equal forces upon that body and 
generates equal velocities; and in the total time it . . . generates a 
total velocity proportional to the time. And the spaces described in 
proportional times are as the velocities and the times jointiy that is, 
in the squared ratio of the times. 



Well, this is the sort of explanation that makes you glad that you aren't trying 
to learn mathematics in the 17th century! What Newton is saying is the follow- 
ing: Suppose our body starts at rest. The force of gravity which we think of 
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as an instantaneous force acting at time 0, changes the downward velocity by 
a small increment v. So after a small increment of time t the body has fallen 
by an amount vt. Then its velocity receives an additional increment v, giving 
it velocity 2v so after the next increment t of time it falls by the amount 2vt. 
In the next increment of time it falls by the amount 3vt, then by the amount 
4vt, etc. So at the end of a large number N of such increments it has fallen 

(1+2 + 3H h N) times vt, which is close to N 2 vt/2, or to vT 2 /2, where T 

is the total time it has fallen. Nowadays, of course, we just say, If s" — a for a 
constant a, then s' — at, and thus s — \at 2 . 

Even after all this explanation, you might wonder what this has to do with 
the second law, since it seems to be a purely mathematical result about second 
derivatives. So it's important to go back and look at one particular phrase in 
Newton's argument: 

When a body falls, uniform gravity, by acting . . . 

What does Newton mean here by "uniform gravity"? You might think he means 
that gravity produces the same acceleration on the body no matter what velocity 
it already has (and this is indeed one of the things he is assuming), but Newton 
is also appealing (implicitly, it is true) to an important fact that we can test 
experimentally. 

To perform this experiment, we are going to use the following apparatus. 



Now as soon as you see this large spring, you might be thinking: oh, we're 
going to invoke Hooke's law. But we aren't going to be using anything of the 
sort; notice that's no scale here behind the spring only a blank piece of wood. 
We only care about one thing: experience shows us that it takes a greater force 
to stretch a string by a large amount than is required to stretch it by a small 
amount — we are completely uninterested in the particular law involved (which 
we could investigate later, if we ever figured out how to measure force). 

For the first part of the experiment, we attach a block to our spring and use 
this nice felt pen to mark on the wood how far down the spring has been pulled. 
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Of course, this isn't a high-tech experiment, it's a Tokyu Hands experiment. 
But we could easily imagine a much more refined experiment, with a very strong 
spring, very accurate ways of measuring its displacement, etc. 

For the sake of time, I'll simply describe the second part of the experiment, 
instead of having us actually perform it. For this part of the experiment, we 
would all go upstairs to the room above this one. Perhaps there's some one 
there now, well, we'll just ask them if we can take a moment's time to perform 
a simple experiment. And this simple experiment is a repeat of the experiment 
that we have just done here: we attach the block to our spring, and see far down 
the spring has been pulled. It appears that the spring has been pulled down 
exactly the same amount. (At least within the accuracy of our Tokyu Hands 
experiment.) 

The next part of the experiment is one that probably no one except a tourist 
like myself might be willing to perform, taking the same measurement at the 
top of Tokyo Tower. Once again, it seems that the spring is stretched by exactly 
the same amount. 

Now this is what Newton means when he speaks of "uniform gravity": a force 
that is the same no matter how high up we go (of course, that's not really true 
for the force of gravity, but it's true to a very good approximation for the sort 
of distances above the earth's surface that we are concerned with). The point 
is that we can at least say when two forces are equal without having to specify a 
way of measuring them. 

Thus, the experimental evidence that the force of gravity is equal at all rea- 
sonable distances from the earth's surface is consistent with the equality of 
acceleration along the path of a falling object, which is essentially equivalent to 
Galileo's observation that "the descent of heavy bodies is in the squared ratio 
of the time". 

It may seem a long way from "equal forces produce equal accelerations" to 
"force equal mass times acceleration", but that's mainly because we still haven't 
given an "operational definition" of mass, describing how it is to measured. The 
important point is that the results of our little experiment actually suggest a way 
of producing such an operational definition. 

We'll begin with a definition that is conceptually very straightforward, al- 
though it would be rather awkward to use in practice. 

First we want to have a very long air-trough, with a carriage, of negligible 
mass, in which we can place a body whose mass we want to measure. Parallel 
to this air-trough we have a track with a little cart that can be pulled along the 
track with any desired acceleration a; for simplicity let's imagine that we merely 
have to turn a knob on an instrument panel to vary a, without worrying about 
the clever mechanism that would be required to produce this effect. Of course, 



Tokyu Hands is a large do-it-yourself and hobby store where all the paraphernalia for 
this lecture was purchased. 
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our track, and the air-trough, will also have to be very long if we expect to pull 
the cart with constant acceleration a for any reasonable amount of time. 

The cart is provided with an extending arm that can be placed behind the 
carriage on the air-trough so that the carriage is moved in tandem with the cart. 
But instead of placing this arm directly behind the carriage, we will put a nice 
strong spring between them. 



spring 
^ :arriage 



1 


J 


extending arm 




Top View 


1 


■ 


cart 



Now let's choose a particular body Bq that we want to be our "unit mass", 
so that we will assign it mass m = 1. We place this body in the carriage and 
pull our cart with some convenient constant acceleration flo- Initially, of course, 
the carriage will not move with the same acceleration, because the spring will 
compress somewhat, so that the carriage won't move exactly in tandem with 
the cart. But we very quickly reach the point where the spring is no longer 
compressing, or at any rate the length of the spring is constant within the limits 
of accuracy of our measurements. We carefully measure this final length, and 
call it Lq. 

Note, by the way, that this whole set-up is dependent on our original experimental 
observation that equal forces produce equal acceleration: the compression of the 
spring measures the force that is being applied to the carriage, and once the 
carriage is moving with a constant acceleration, the force applied to it must be 
constant. 

Now let's take some other body B whose mass m we want to determine. 
We place B in the carriage instead of Bq and once again pull our cart with 
constant acceleration flo, and observe the (final) length of the compressed spring. 
It probably isn't Lq any more, so we try adjusting the acceleration a in order 
to make it become Lq: if the spring was compressed more, to a length < Lq, 
we try an acceleration < ao, if it was compressed to a length > Lq we try an 
acceleration > ao- After lots of trial and error, we finally find an acceleration 
a\ which compresses the spring to exactly the length Lq. We now define 

mass m of B — — . 

a\ 

This definition makes the law F = ma work for any particular fixed F, and 
much experimentation would show that it works just as well for any other F; in 
other words, if we repeated this whole process using a different spring, and thus 
a different Lq, we would still end up assigning the same masses to all bodies. 
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Then, of course, we can use the equation in reverse, as a way of measuring force, 
by seeing what acceleration is produced on a body of some known mass m. 

Before proceeding further, two points should be made. First, our original little 
experiment is certainly consistent with F = ma, but it would hardly seem to be 
very conclusive. After all, how do we know that the correct law isn't something 
like F = ma + ka' for some constant k, so that third derivatives, or even higher 
derivatives, are involved? I don't know of any experiments to directly test this, 
but of course there is an enormous body of experience that attests to it: the 
force of gravity isn't constant over large distances, so all the calculations that 
keep satellites in motion, guide space ships to the moon and land them, etc., 
present a great deal of evidence. 

A second point to ponder is that we always take for granted that mass is 
"additive": if we take bodies of mass m\ and rm and join them together (for 
example, by placing them together in the carriage on our air-trough), then the 
new object should have mass m\ + («2- In terms of our operational definition, 
this hardly seems clear: it says that if a\ is the acceleration that the first body 
must be subjected to in order to compress the spring to length Lq, while «2 is 
the acceleration that the second body must be subjected to in order to compress 
the string the same amount, then to obtain the same compression for the two 
objects together, they must be subject to an acceleration a satisfying 

1 11 

a a\ ci2 

to obtain the same compression. At first glance, this might seem to be a strange 
fact that could only be verified by experiments with large numbers of varying 
masses, so it is interesting to try to figure out what reasonable body of basic 
experimental facts would lead one to the additivity of mass in a more reasonable 
manner, an exercise you might like to think about before the next lecture. 

Finally, here is a less direct, but more convenient operational definition of 
mass, based on the mathematician's and physicist's common view that a straight 
line is just a circle of infinite radius. 

Instead of using an air-trough, we simply attach our body B to the end of 
a very stiff spring that is being rotated horizontally with some large constant 
"angular frequency" a, so that B moves along the circle 

c(t) = R(cosat,sinat) 

for some radius R. This radius R will be somewhat larger than the unstretched 
length of the spring, because B actually begins moving along a spiral, pulling 
the spring out, though its path soon becomes indistinguishable from a circle. 
For the acceleration we simply have 

(A) c"{t) — -Ra 2 (cosat,smat), 
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so that the acceleration always points directly inward, and has magnitude Ra 2 . 
This means that the force F that the spring exerts on B also always points 
directly inward and has constant magnitude. 

Declaring B to be our unit of mass amounts to saying that 

(a) |F| = 1 • Ra 2 . 

To determine the mass m of any other body, we attach it to the end of our 
spring and vary the angular frequency with which we rotate it until we arrive 
at an angular frequency fi for which our body is moving along a circle 

c(t) = R(co$Pt,smPt) 

of the same radius R. Now we should have 

(b) |F| = m ■ RP 2 , 

with the |F| in equations (a) and (b) having the same value, since in both cases 
the spring has been stretched by the same amount. In other words, we can 
determine m by 

m=a 2 /p 2 . 

We've ignored the effect of gravity on these bodies, but that would become 
negligible in comparison to the force of our stiff spring when a is large, or we 
might imagine the measurements being made in outer space. 

I'm sure that the basic mechanism for this definition could be greatly refined. 
Instead of a spring, one might whirl a tube filled with mercury, and measure 
the compression of this mercury column, etc. But I don't think any one has 
ever actually produced a mechanism of this sort. In fact, as far as I know, no 
one has ever measured the mass of anything accurately. This statement obviously 
requires a bit of explanation! 

Let's consider these two balls, which we used previously in a little "experi- 
ment" to illustrate the concept of mass, and again enlist the aid of the student 
who performed the experiment, and quite possibly stated, incorrectly, that the 
experiment showed that the second ball had a greater weight than the first 
(rather than a greater mass). We could then continue the experiment by taking 
each of the balls off the table, in turn, and having the student hold it. Almost 
no effort would be required to hold up the first ball [as you can see when I 
hold it up], while considerably more effort would be required for the second 
ball [as you can now see me straining a bit to hold up the second ball]. This 
second experiment shows that the weight of the second ball — a measure of the 
gravitational force exerted on it by the earth — is much greater than the weight 
of the first. 

In the first experiment, the effect of this gravitational force was irrelevant 
(except insofar as it affected friction), because the table was counteracting this 
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force, so we were simply comparing the mass of the two bodies, while the sec- 
ond experiment directly compares the weight of the two bodies. So one of the 
first things that every beginning physics student needs to learn is the difference 
between mass and weight. For modern students, it's probably especially easy to 
illustrate this difference by considering what happens in a freely moving space 
craft where everything has no weight at all, so that these two balls would simply 
float where ever they were placed, but where moving one of the balls would be a 
lot more difficult than moving the other. 

On the other hand, once we've clarified this distinction, we have to confuse 
the students once again, by pointing out that the relative masses of two bodies 
seem to be the same as their relative weights — everyday experience would cer- 
tain have led us to conclude, without even thinking about it, that the second 
ball, so much harder to set in motion, would also be much harder to lift. 

In terms of the second law, we can make a much more specific correlation: 
Since the weight of an object, of mass m, say, is the force F of gravity on it, 
the law F = m • v' means that the ratio F/m of weight to mass is simply the 
acceleration that an object undergoes under free fall. Thus, proportionality of 
weight to mass is equivalent to the assertion that all bodies fall with the same 
acceleration, the famous fact usually attributed to Galileo. 

Aristotle, as we all know, had claimed that large bodies fall faster than smaller 
ones, and people apparently believed this for many centuries afterward, but 
it never made much sense. Even before Galileo's experiment, a man named 
J.-B. Benedetti (1530-1590) had pointed out how absurd this would be (see [M], 
Chapter II, section I, 2). After all, suppose we drop a brick like this one [luckily, 
it was a fake brick, so it didn't hurt the floor], and then take two of these bricks, 
tape them together, and drop them side by side: why in the world would the 
two bricks fall faster than each individual brick? But Benedetti still thought 
that denser bodies would fall more rapidly than less dense ones, and Galileo 
is usually credited with being the first to realize that even bodies with differing 
compositions, like wood and iron, fall at the same rate. Modern physicists would 
choose something like aluminum and gold, which have such different ratios of 
protons and neutrons, because, after all, we're really saying that everything, even 
each of the elementary particles, falls at the same rate! 

Newton, of course, would never had made Aristotle's first mistake, and cer- 
tainly realized that it was a question of whether bodies of differing compositions 
fall at the same rate. Although Galileo had performed experiments to deter- 
mine this, Newton wanted much more accurate results, and he informs us of 
this right at the beginning of the Principia. 

Recall that the Principia begins with a definition that we previously showed 
with some material blocked out. 

Definition 1 Quantity of matter is a measure of matter that arises from its density and 
volume jointly. 
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If the density of air is doubled in a space that is also doubled, 
there is four times as much air, and there is six times as much if 
the space is tripled. The case is the same for snow and powders 
condensed by compression or liquefaction, .... Furthermore, I 
mean this quantity whenever I use the term "body" or "mass" in 
the following pages. 







v//////////////////, 


y///////////////////, 



Adding in the remaining material we have 



Definition 1 Quantity of matter is a measure of matter that arises from its density and 
volume jointly. 

If the density of air is doubled in a space that is also doubled, 
there is four times as much air, and there is six times as much if 
the space is tripled. The case is the same for snow and powders 
condensed by compression or liquefaction, .... Furthermore, I 
mean this quantity whenever I use the term "body" or "mass" in 



the following pages. It can always be known from a body's weight, 
f or — by making very accurate experiments with pendulums — I have 
found it to be proportional to the weight, as will be shown below. 



This was certainly a confusing place for Newton to place this remark (just as 
he is introducing the notion of mass, which he wants to distinguish from weight!), 
and it must have been all the more confusing when he added the phrase "as will 
be shown below". Normally, one would expect this phrase to refer to material 
at the bottom of the page, or perhaps just a few pages later on. But Newton's 
"shown below" actually refers to material about 400 pages later in his book! 

Newton's experiments involved pendulums, which was the 17th century high- 
tech way of avoiding friction. We'll consider the physics of the pendulum in 
greater detail later on (after all, a pendulum is basically a rigid body), but for 
now let's simply settle for the usual elementary physics analysis, which involves 
decomposing the gravitational force F of magnitude gm (where g is the acceler- 
ation due to gravity) on the bob into a force Fi in the direction of the pendulum 
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string and another force Fg tangent to the path of the bob. The string is also 
exerting a force, F2, on the bob, which is assumed to point along the direction 
of the string We must have F2 = — Fi, since we assume that the bob stays at a 
constant distance from the pivot point, keeping the string taut but not stretching 
it out. Thus, the net force on the bob is F + (— Fi) = ¥$, and consequently the 
acceleration of the bob, tangent to the circular path, has magnitude 

(1) a e = gsin#. 

If we consider as a function of time, and let / be the length of the string 
(the radius of the circle on which the pendulum bob moves), then equation (1) 
yields 

0" + j sin = 0. 

Physicists virtually always restrict their attention to the case of "small oscilla- 
tions" (0 ~ 0), so that they can replace sin by 9 and get an equation that they 
can solve. But the information we need, and that Newton relied on, doesn't 
require that consideration, and is a pure similarity argument. 

For convenience, we choose the origin O to be the point from which the 
pendulum hangs. Then for any a > the path 

y(t) — a ■ c(t/*/a) 

follows a circle with radius a times the radius of the path c, but with the time 
reparameterized by the factor l/^/a. So the angle &(t) for y satisfies 

and it follows that 

0" + sin = 0. 
al 

This means that y gives the path of a pendulum bob with length a times that of 
the original, and we easily conclude that the period of the pendulum described 
by y is times the period of the pendulum described by c: the period of a 
pendulum is proportional to the square root of its length. 

A similar argument, left to you, shows that if the acceleration g were replaced 
by g ■ a, then the period of the pendulum would become \/\fa times the orig- 
inal period; so the periods T\ and 7~2 of a pendulum bob undergoing different 
accelerations g\ and g2 are related by 

gl Tl 

Newton phrased this result in a somewhat different way. If two objects, of 
masses m\ and W2, have gravitational accelerations g\ and g2, and we denote 
their weights by W\ = g\m\ and W2 = gimi, then (*) can be written as 

mi _ Wi T x 2 
m 2 W2 T 2 2 ' 
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This result first occurs in the Principia about 300 pages from the beginning, a 
mere 100 pages before Newton's description of his pendulum experiments, and 
I think it will be amusing to quote it as it appears: 



Proposition 24 In simple pendulums whose centers of oscillation agree equally distant from 
the center of suspension, the quantities of matter are in a ratio compounded 
of the ratio of the weights and the squared ratio of the times of oscillation 

For the velocity that a given force can generate in a given 
time in a given quantity of matter is as the force and the time 
directly and the matter inversely . . . Now if the pendulums are 
of the same length, the motive forces in places equally distant 
from the perpendicular are as the weights, and this if two os- 
cillating bodies describe equal arcs and if the arcs are divided 
into equal parts, then, since the times in which the bodies de- 
scribe single corresponding parts of the arcs are as the times of 
the whole oscillations, the velocities in corresponding parts of 
the oscillations will be to one another as the motive forces and 
the whole times of the oscillations directly and the quantities of 
matter inversely; and thus the quantities of matter will be as the 
forces and the times of the oscillations directly and the veloci- 
ties inversely But the velocities are inversely as the times, and 
thus the times are directly, and the velocities are inversely, as 
the squares of the times, and therefore the quantities of matter 
are as the motive forces and the squares of the times, that is, as 
the weights and the squares of the times. Q.E.D. 



I haven't the slightest idea what any of this means! But I'm almost certain 
that it amounts to the similarity argument we have given. Aren't you glad that 
you aren't a mathematician of the 17th century!? 

About 100 pages later Newton describes how he tested 

(*> f = %, 

and thus the proportionality of weight to matter, with equal weights of "gold, 
silver, lead, glass, sand, common salt, wood, water, and wheat". Each pair 
of materials to be tested was enclosed within one of two rounded, equal-sized 
wooden boxes. For the wood bob he simply filled the inside of the box with 
more wood, but for the gold bob he suspended the gold at the center of the 
box; he then hung each of the two boxes by eleven-foot cords, which "made 
pendulums exactly like each other with respect to their weight, shape, and air 
resistance." 
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He then started them swinging close to each other from the same height, 
noting that "they kept swinging back and forth together with equal oscillations 
for a very long time. . . . And it was so for the rest of the materials. In these 
experiments, in bodies of the same weight, a difference of matter that would be 
even less than a thousandth part of the whole could have been clearly noticed." 

Of course, once we know that weight is exactly proportional to mass, there's 
no longer any need to measure mass accurately. To compare the mass of two 
objects, we simply have to compare their weights, and weights can easily be 
measured very accurately, with a balance scale. A balance scale is a special sort 
of lever, one in which the lever arms have the same length, but we don't need 
to understand the lever to use the balance scale, since symmetry implies that 
two objects that balance at the same distance from the pivot point must have 
the same weight (just to be on the safe side, we would probably interchange the 
two objects for a second check). 

Clever experiments have allowed Newton's result to be verified to much 
greater accuracy in later times, and you can read in physics books about the 19th 
century Eotvos experiment (see, for example, [Fr]) and the 1964 experiment of 
Roll, Krotkov, and Dicke (described in [M-T-W]) which verified proportionality 
of weight to mass within 1 part in 10 n for gold and aluminum. The only thing I 
would like to point out here is that in both of these experiments, as in Newton's 
original experiment, we never actually measure the mass of anything! We only 
measure weights. 

At this point, I think I would have to reassure the students in our elementary 
physics class by saying: 

You may be starting to feel somewhat overwhelmed. First we insisted on 
distinguishing between mass and weight. Then we claimed that this distinction 
didn't really seem to matter, because weight was proportional to mass. Finally, 
we noted that we established this fact to great accuracy without even being able 
to measure mass to great accuracy! Don't worry! Every one before Newton 
was just as confused as you were! Newton was the first person to really make 
the distinction between mass and weight, and his decision to make mass and 
force the basic concepts in terms of which others should be defined — and to 
choose the first two laws as the basis for deducing other results — was one of 
Newton's main achievements. That's why we still speak of classical mechanics 
as Newtonian Mechanics. 

The first two laws, on which we have spent so much time, involve individual 
bodies, but say nothing about the interactions between different bodies. This 
information is given by Newton's third law, and since all of mechanics rests on 
Newton's three laws, this last one must be quite special. 

In fact, it is usually stated in a way that almost guarantees its misuse in philo- 
sophical and political discourse: 

Law 3 To any action there is always an opposite and equal reaction 
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But Newton's statement was much more specific 

Law 3 To any action there is always an opposite and equal reaction; in other words, the actions 
of two bodies upon each other are always equal and always opposite in direction. 

Thus, the third law always involves two different bodies, each exerting a force 
on the other. Most misuses and invalid analogues of the third law ignore this 
basic fact that the two actions in question are exerted on two different bodies. 

Although I've made a snide comment about philosophers' and political the- 
orists' misuse of the third law, mathematicians and scientists themselves rarely 
appreciate the true significant of this law and its consequences. In a typical 
first-year physics course the law is simply stated, with one or two examples, and 
then earnestly applied to the solution of mechanics problems, with every one 
carefully drawing force diagrams to specify all the different reaction forces in- 
volved. Almost no mention is even made of the experimental evidence for this 
law! 

There is, however, one consequence of the third law that is always mentioned. 
Consider the collision of two objects: B\, having mass m\, and B2, having 
mass ni2- During the collision, B2 will be exerting a force F12 on B\, while B\ 
will be exerting a force F21 on B2 (the first subscript indicates the body on which 
the force acts). We should really write Fn(t) and F21O) because these forces 
may vary with time; in fact, they presumably vary in an incredibly complicated 
way, depending on the particular way that the two bodies are compressed, spin, 
vibrate, undulate, bobble, etc. But we always have Fn(t) = — F21 (t), so for all 
times t we have 

(mivi + /n 2 v 2 )'(0 = (wivi)'(0 + (m 2 v 2 )'(0 
= F 12 (0 + F 2 i(0 
= 0. 

Thus, no matter how complicated the collision may be, the "total momentum" 
m\\\ + m 2 V2 is constant. Or, as we like to say, momentum is conserved. 

This result appears in the Principia as a Corollary of the three laws, and we 
will quote it here, together with the proof, just to give another example of the 
flavor of mathematical writing at that time: 

Corollary 3 The quantity of motion, which is determined by adding the motions made in 
one direction and subtracting the motions made in the opposite direction, is not 
changed by the action of bodies on one another. 

For an action and the reaction opposite to it are equal by law 3, 
and thus by law 2 the changes which they produce in motions are 
equal and in opposite directions. Therefore, if motions are in the 
same direction, whatever is added to the motion of the fleeing body 
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will be subtracted from the motion of the pursuing body in such 
a way that the sum remains the same as before. But if the bodies 
meet head-on, the quantity subtracted from each of the motions 
will be the same, and thus the difference of the motions made in 
opposite directions will remain the same. 

The separate consideration of the case of two bodies moving toward each 
other and the case of one body overtaking another seems awfully awkward to 
us, but, as we will soon see, Newton may have had a specific reason for making 
the distinction clear. 

This corollary of third law, usually presented as a neat consequence in physics 
classes, is actually the experimental evidence upon which Newton originally 
relied. In fact his Scholium tells us 

Scholium The principles I have set forth are accepted by mathematicians and 
confirmed by experiments of many kinds. 



. . . Sir Christopher Wren, Dr. John Wallis, and Mr. Christiaan Huy- 
gens, easily the foremost geometers of the previous generation, inde- 
pendently found the rules of the collisions and reflections of hard 
bodies, and communicated them to the Royal Society at nearly the 
same time. 

This short sentence, off-handedly mentioning the near simultaneous commu- 
nication of the same results, has a bit of history behind it. 

Descartes seems to be the first person to have stated the law of Conservation 
of Momentum, offering the reasoning that since God made the Universe with 
a certain amount of momentum in it, obviously He would not allow this mo- 
mentum to be changed (I admit that those aren't his exact words, but they are 
not an unfair paraphrase; see [D], Chapter 4, section 3 for an exact statement). 
One might feel that although this would not be the sort of argument to be ac- 
cepted in modern physics journals, at least he did state the law. Unfortunately, 
however, Descartes didn't even state the law correctly, because he didn't think of 
momentum as a vector quantity, or even — for the case of bodies colliding along 
a straight line, the special case that people tended to restrict their attention to 
in those days — as a signed quantity, so he was really assuming that the abso- 
lute value of momentum was conserved. He then deduced from his principle 
a long series of consequences about colliding bodies, almost all of which are 
incorrect. Moreover, Descartes knew that his deductions didn't seem to accord 
with experiment, but blithely dismissed this as due to experimental errors! 
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Naturally, not every one was so convinced, so the Royal Society had asked for 
research into the proper rules, and this was the impetus for the communications 
of Wren, Wall, and Huygens. 

You might assume that, unlike Descartes, these three men simply carried 
out experiments to see what actually happened, but that's not quite the case. 
Newton goes on to say 

... Sir Christopher Wren, Dr. John Wallis, and Mr. Christiaan Huy- 
gens, easily the foremost geometers of the previous generation, inde- 
pendently found the rules of the collisions and reflections of hard 
bodies, and communicated them to the Royal Society at nearly the 
same time, . . . But Wren additionally proved the truth of these rules 
before the Royal Society by means of an experiment with pendulums 

Although Wallis and Huygens didn't do experiments, the "thought exper- 
iments" of Huygens happen to be extremely interesting, and we will consider 
them at a later time. But right now we want to briefly discuss those experiments 
with pendulums. 

As we've already mentioned, pendulums were the 17th century way of practi- 
cally eliminating friction, and they have an additional virtue: By observing how 
high a body swings past the lowest point, one can figure out the velocity that 
it had at that lowest point. Thus, Wren could determine both the velocities of 
two bodies just as they collided at the bottom of their pendulum arcs with the 
velocities that they obtained right after the collision. 

After mentioning Wren's experiments, Newton proceeds to give details of his 
own experiments, designed to check the results with much greater accuracy, and 
he spends three entire pages describing them (these experiments are completely 
different from the ones used to verify proportionality of weight and mass, which 
occur much later, though mentioned at the very beginning of the Principia). 

Newton is actually being rather modest in saying that "the principles I have 
set forth are accepted by mathematicians ..." because, as far as the third law 
is concerned, it had only appeared in this form, as conservation of momen- 
tum, and it was Newton who then worked backward to formulate his third law. 
Moreover, Newton took the truly audacious step of generalizing this law, based 
only on one particular physical phenomenon, the (totally unknown) forces in- 
volved in the collision of bodies, to arbitrary forces. One of the forces that 
Newton was particularly concerned about, of course, was gravity, but it will be 
quite interesting to first consider another force with which we are all familiar, 
namely, magnetism. 

I'd like to consider a simple experiment, but I'll have to be content with 
describing it, rather than performing it, because it would require an air-trough, 
as well as rather strong magnets, stronger than I could find at Tokyu Hands, 
and a few other bits of modern technology, like a strobe light. 
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We simply place a magnet and a piece of iron on the air-trough, release 
them, and make accurate measurements of their positions over very many small 
intervals of time. This allows us to determine their velocities at these times, and 
check that the total momentum at all such times is the same. 

But we can also make a convincing case without doing any measurements at 
all, by considering what happens when we release the magnet and the piece of 
iron simultaneously (so that they have total momentum at that moment). I 
would really like to try this experiment with a class of naive physics students, 
after having first shown some collisions when the two are not released simulta- 
neously, just to confuse the issue a bit. It would be interesting to present two 
different scenarios, one in which the magnet is small, and the piece of iron 
quite large, and one in which the situation is reversed, and ask students what 
they think will happen. I don't know, of course, because I haven't tried this 
(sociology) experiment, and the results would certainly depend on the sophis- 
tication of the students involved, but I wouldn't be surprised if many students 
would suspect that the heavier body would end up pushing the lighter one. 

Of course, once we start thinking along the lines of conservation of momen- 
tum, we see that when the magnet and iron collide, no matter what their relative 
masses are, they must come to a dead stop, since their total momentum must al- 
ways be the same as at the beginning, namely 0. Moreover, this one observation 
should suffice to convince us of conservation of momentum in this experiment, 
without making any of the intermediate measurements: Since the total momen- 
tum at the moment of collision is 0, it must have been very close to just before 
the collision. But we can obtain a whole range of velocities just before collision 
simply by varying the initial distance between the magnet and iron — or, equiv- 
alently by varying the strength of the magnet. Thus the total momentum must 
always be zero! 

This experiment could just as well have been carried out using two magnets, 
but I chose a magnet and a piece of iron to point out the unintuitive aspect of the 
third law, which I would emphasize to a physics class by another simple pair of 
experiments. 

Let us once again start with a magnet and a piece of iron held in place, and 
then simply release the iron, so that it hurtles toward the magnet. What does this 
experiment show? Well, obviously, that the magnet exerts a force on the iron; 
or, in more formal terms, that iron undergoes an acceleration in the presence of 
a magnet. Now we repeat the experiment, this time keeping the iron fixed, and 
releasing the magnet, which goes rushing toward the iron. Again we ask, what 
does this experiment show? Quite likely, many would say that it just shows the 
same thing, that a magnet exerts a force on the iron. But, of course, it doesn't 
show anything of the sort, since we don't observe the iron moving. What it 
shows is that iron exerts a force on the magnet. But we tend not to think in those 
terms. Normally we think of a piece of iron as a hunk of matter pretty much 
like any other piece. The magnet is something special, it exerts a force on the 
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iron. But, in addition, the iron becomes special at the same time, something we 
may not usually acknowledge. If we take one of those cute little magnets that 
people use to hold notes on a refrigerator door, and hold it near the refrigerator, 
we don't notice the refrigerator moving toward the magnet! Instead, we feel the 
magnet being pulled toward the refrigerator. Nevertheless, people don't usually 
go around saying that refrigerators attract magnets. What's truly amazing of 
course, is that not only does iron attract magnets, but it does so in exactly the 
right amount so that conservation of momentum holds. 

Remarkably enough, our modern understanding of magnetism happens to 
make this wonderful reciprocity quite understandable: the individual atoms of 
the iron each act as magnets, except that they are oriented randomly, and the 
magnet causes them to align, so that the iron now acts as a magnet also. So, 
ultimately, it's all a matter of iron atoms attracting each other, and we have a 
completely symmetric situation. 

That same argument, of course, would make it quite clear why two objects 
that have been given static electric charges should exert forces of equal magni- 
tude on each other, since ultimately it's all due to the mutual repulsive forces 
between electrons. 

Strangely enough, it's actually the most common example, the repulsive force 
of colliding bodies, that now seems the most mysterious. Presumably this in- 
stance of the third law has to reduce to the third law operating on any pair of 
particles. So it must hold, for example, for a proton and a neutron, where the 
situation seems decidedly unsymmetric. In fact, it starts to be clear that if one 
really understood the third law, one would basically understand all of atomic 
physics. 

I should mention that the little experiment involving a magnet and piece of 
iron that I have concocted, and which I feel would make a great teaching tool, 
was inspired by something Newton said, but hardly something for which I would 
praise him; in fact, it may be the silliest thing that Newton ever said (or, at any 
rate, the silliest scientific statement he ever made). 

After his description of his pendulum experiments, which involved the repulsive 
force of collisions, Newton also wanted to say something about attractive forces 
(since he had gravity in mind). So after three pages describing his careful ex- 
perimentation, he immediately adds the following paragraph, for which I have 
provided a quick picture: 



I demonstrate the third law of motion for attractions briefly as follows. 
Suppose that between any two bodies A and B that attract each other 
any obstacle is interposed so as to impede their coming together. If one 




interposed object 



23 



LECTURE 1 



body A is more attracted toward the other body B than that other body B 
is attracted toward the first body A, then the obstacle will be more strongly 
pressed by body A than by body B and accordingly will not remain in 
equilibrium. The stronger pressure will prevail and will make the system 
of the two bodies and the obstacle move straight forward in the direction 
from A toward B and, in empty space, go on indefinitely with a motion 
that is always accelerated, which is absurd and contrary to the first law of 
motion. 

Thus, after three pages of careful experiment, Newton provides a one para- 
graph theoretical argument, and this argument is patently nonsense! The first 
law is concerned with the force on one body, not on a "system" consisting of 
more than one body. Moreover, the whole argument depends on the fact that 
the "interposed" object is rigid, so that it keeps A and B separated, and as you 
might imagine, our analysis of rigid bodies will presuppose the third law. Fi- 
nally, we might note that the same argument could just as well be made to work 
for repulsive forces: 



What's even more amazing is that Newton actually described an experiment 
made to test this idea, using vessels floating on water instead of an air-trough 
to reduce friction: 

I have tested this with a lodestone and iron. If these are placed in separate 
vessels that touch each other and float side by side in still water, neither one 
will drive the other forward, but because of the equality of the attraction in 
both directions, they will sustain their mutual endeavors toward each other, 
and at last, having attained equilibrium, they will be at rest. 

It seems reasonable to assume that Newton did indeed actually perform this 
experiment, in view of his careful description of so many other experiments, but 
it must be one of the weirdest negative experiments every performed. Did any 
one really think that the two vessels would go zooming off in one direction! 

Although this episode must be written off as a strange anomaly in Newton's 
thought, simply removing the vessels that separate the magnet and iron suggests 
the more interesting experiment that we have performed before, and, I hasten 
to add, later lectures will confirm the brilliance of Newton's thought, and show 
how far ahead of all his contemporaries he was. 
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FURTHER REMARKS 
ON THE FUNDAMENTALS 



In the previous lecture I pointed out that our operational definition of mass 
didn't seem to provide any clear reason why mass should be additive, and chal- 
lenged people to find a reasonable explanation of this phenomenon. That was 
a challenge that I myself wasn't prepared to meet at the time, but I think the 
following would be a good answer. 

First we would need experimental evidence establishing, in effect, that "forces 
are additive". Suppose that our cart needs to be given the acceleration a\ in 
order to compress the spring to length Lq when some object is placed in the 
carriage of the air-trough. And now suppose that instead of inserting the single 
spring, we insert two springs of the same construction. What we will find is that 



carnage 



cart 



the cart must be given the acceleration a\ = 2a\ in order for the two springs to 
be compressed simultaneously to the length Lq. Although this might seem clear 
if you think in terms of the usual F = ma law, it is basically an experimental 
fact that we have to test, along with similar tests for three springs, four springs, 
etc. 

Another way of expressing this is to say that the mass of our body can be 
determined by 

m = 

Qfl/2 

when we use two springs instead of one on an object whose mass we are trying 
to determine (ao is still the convenient acceleration that we used on our "unit 
mass" to determine Lq, with just one spring). 

On the other hand, suppose that we place two copies of our object on the 
carriage, with one spring behind each. Presumably the cart will require an 
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acceleration of a\ to compress the two springs to length Lq, since we can simply 
think of this as two copies of the original experiment carried out side-by-side. 
If we now think of this as two springs behind the one object consisting of two 
copies of our original object, it follows that the mass of this new object is just 

— — = 2 — = 2m, 
fli/2 a\ 

and of course we could just as well repeat the argument for other multiples, and 
eventually reason our way to the general rule. 

I'd also like to examine the "thought experiments" that Huygens adduced 
in support of conservation of momentum, not only as an interesting historical 
curiosity, but also because it connects rather closely to the whole notion of 
"symmetry", which gets rather abused in modern thought. 

First consider two identical bodies, say two steel balls, moving toward each 
other with equal speeds, i.e., with velocities v and —v. In this simple situation it 
is obviously reasonable to assume, on the basis of symmetry, that their rebound 
velocities will also be negatives of each other, w and — w, so that conservation 
of momentum holds: it is both before and after the collision. 

Now let us imagine the same experiment as observed in a coordinate system 
that is moving with uniform velocity u with respect to us, like a boat moving 
with respect to the shore, to take Huygens' example. ([M], Chapter III, section 
IV contains a reproduction of the delightfully quaint illustration that appears 
in Huygens' book "De Motu Corporum ex Percussione" of 1703; this figure 
also appears in [Fr] , which gives a fairly detailed description of the following 
argument). In this coordinate system, the objects are moving with the initial 
velocities 

vi = v + u and V2 = — v + u, 

while their rebound velocities are 

wi = w + u and W2 = — w + u, 

so vi + V2 = Wi + W2 (= 2u). Since we can obtain any pair Vi , V2 by choosing the 
appropriate u and v, we find that in the coordinate system of the boat, moving 
uniformly with respect to the shore, conservation of momentum holds for two 
identical bodies approaching each other with arbitrary velocities. Of course, we 
could just as well interchange the role of the boat and the observer on shore, to 
reach the same conclusion for our observer on shore. 

Rather than following the succeeding course of Huygens' arguments, we will 
add some considerations from The Feynman Lectures on Physics, Volume 1 
[Fey]. Let's use steel cubes for convenience, and suppose that glue has been 
applied to opposing faces so that they will stick together when they meet. Sym- 
metry dictates that when they approach each other with the same velocity and 
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then stick together, they will end up at rest, so that conservation of momentum 
holds. Huygens' argument then implies that a collision with initial velocities Vj 
and \2 results in a "double cube" moving with velocity ^(vi + V2). 

We can apply these results to the case of one cube with velocity vj colliding 
with two other cubes that have glue applied to opposing faces, but are moving 
in tandem, separated by a tiny distance, with velocity V2. Immediately after 

, V2 



Vl t < V2 

cube 1 and cube 2 collide, they have velocities Wi and W2 satisfying 

(a) Wi + W2 = Vl + V2. 

A moment later, cube 2 collides with and sticks to cube 3, after which the 
resulting double cube moves with velocity W12 satisfying 

(b) 2\Vi2 = W2 + V2. 

It follows that 

initial total momentum = m\\ + 2m\2 

= m(y\ + V2) + m\2 

= m(v/i + W2) + my 2 by (a) 

= mw\ + m (w2 + V2) 

= 777Wi + 2mwi2 by (b) 

= final total momentum. 

Imagining the tiny distance decreased to 0, we conclude that conservation of 
momentum holds for a collision of a cube with a double cube, and we can easily 
generalize the argument for any multiple cube colliding with any other. 

This clever argument might require supplementary considerations to deal 
with steel cubes stacked differently let alone with objects of arbitrary shape, 
but the real problem is that it applies only to two objects made of the same 
"homogeneous" material. As soon as we consider objects made of different 
materials we are at an impasse. Given a steel cube and an aluminum cube of 
the same mass, approaching each other with velocities v and — v, no symmetry 
argument allows us to conclude that the rebound velocities are also negatives 
of each other; having the same mass simply means that they are given the 
same acceleration by a given force, it says nothing about why they should react 
symmetrically in this situation. And that is the whole mystery of the third law. 

The third law seems so reasonable, and is accepted so uncritically by physics 
students, precisely because it rather misleadingly seems to be simply a statement 
of symmetry (and philosophers and political theorists appeal to it as expressing 
a sort of moral symmetry). 
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I want to conclude this lecture with one seemingly minor point about the 
fundamentals of mechanics that has so far been omitted. Newton's statement 
concerning conservation of momentum, previously quoted, was 

Corollary 3 The quantity of motion, which is determined by adding the motions made in 
one direction and subtracting the motions made in the opposite direction, is not 
changed by the action of bodies on one another. 

while Corollary 1, which we skipped, says 

Corollary 1 A body acted on by [two] forces acting jointly describes the diagonal of a 
parallelogram in the same time in which it would describe the sides if the forces 
were acting separately. 

Nowadays the vector space structure of W is so ubiquitous and appears so 
natural that we might unhesitatingly aver that the effect of the forces Fi and 
F2 acting simultaneously must simply be the standard vector sum Fi + F2. But 
that can't simply follow from the fact that we've decided to represent forces by 
vectors! In fact, the whole reason for introducing vector addition in the first 
place was because it represented the "addition" of forces. 

As a matter of fact, we can say, without even reading Newton's proof, that it 
can't possibly be correct, since, after all, the three laws on which it is suppos- 
edly based are concerned with single forces acting on bodies — they simply say 
nothing at all about two forces acting simultaneously. 

When we do look at Newton's proof, this is what we find: 

Let a body in a given time, by force M alone im- A 
pressed in A, be carried with uniform motion from 
A to B, and, by force N alone impressed in the same 
place, be carried from A to C; then complete the par- 
allelogram ABDC, and by both forces the body will be carried in the same 
time along the diagonal from A to D. For, since force N acts along the line 
AC parallel to BD, this force, by law 2, will make no change at all in the 
velocity toward the line BD which is generated by the other force. There- 
fore, the body will reach the line BD in the same time whether force N is 
impressed or not, and so at the end of that time will be found somewhere 
on the line BD. By the same argument, at the end of the same time it will 
be found somewhere on the line CD, and accordingly it is necessarily found 
at the intersection D of both lines. And, by law 1, it will go with [uniform] 
rectilinear motion from A to D. 
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Even before we reach any questionable steps, we see from the very first phrases 
that Newton is framing this proof in terms of impulsive forces, since he states 
that the forces M and N individually produce a uniform motion on the object. 
Moreover, at the very end of the argument he implicitly assumes that the combi- 
nation of the two impulsive forces must also be an impulsive force, so that the object 
moves with uniform motion from the initial point A to the final point D. The 
remaining part of the argument is the most dubious of all, with its argument 
that the force N "will make no change at all in the velocity toward the line BD 
which is generated by the other force". 

In defense of Newton, we ought to unveil the material that was hidden in our 
original presentation of the second law: 

Law 2 A change in motion is proportional to the motive force impressed and takes place along 
the straight line in which that force is impressed. 

If some force generates any motion, twice the force will generate twice 
the motion, .... And if the body was previously moving, the new mo- 
tion (since motion is always in the same direction as the generative force) 
is added to the original motion if that motion was in the same direction 
or is subtracted from the original motion if it was in the opposite di- 
rection or, if it was in an oblique direction, is combined obliquely and 
compounded with it according to the directions of both motions. 

This almost sounds like a statement of Corollary 1. Actually, it will also help 
to go back to Newton's Scholium, and unveil the material that was deleted there, 
when we quoted his acknowledgment of Galileo: 

Scholium The principles I have set forth are accepted by mathematicians and 
confirmed by experiments of many kinds. By means of the first two 
laws and the [first corollary] Galileo found that the descent of heavy 
bodies is in the squared ratio of the time and that the motion of pro- 
jectiles occurs in a parabola, as experiment confirms, except insofar 
as these motions are somewhat retarded by the resistance of the air. 

Newton even provides a little picture in the discussion that follows a bit later: 

For example, let body A by the motion of projection alone describe 
the straight line AB in a given time, and by the 
B motion of falling alone describe the vertical dis- 
tance AC in the same time; then complete the 
parallelogram ABDC, and by the compounded 
motion the body will be found in place D at the 
end of the time; and the curved line AED which 
the body will describe will be a parabola which 
the straight line AB touches at A and whose or- 
dinate BD is as AB 2 . 
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Here, of course, we are considering, on the one hand, an impulsive force, 
which gives the object its uniform horizontal motion, and, on the other hand, 
the force of gravity, which gives the object its non-uniform vertical motion. And 
indeed this really illustrates only that the action of a force on an object is inde- 
pendent of the object's uniform velocity, which was Galileo's basic observation. 

Basically, Newton is saying that Corollary 1 holds if we think of one of the two 
forces having already been applied, and then trying to sucker us into concluding 
that it holds when they are applied simultaneously. Of course, from a modern 
quantum mechanical point of view this might seem extremely reasonable, be- 
cause the each force is presumed to come about by interactions with myriad 
special particles, and one could ignore the infinitesimal probability of two such 
interactions occuring exactly at the same time. 

But if we stick to the classical picture, then from a strictly logical point of view, 
it is not even clear that two forces Fi and F2 acting simultaneously should have 
the same effect as any other single force F: while it's true that the combined 
forces must end up producing an acceleration of some sort on each object, that 
acceleration might not be proportional to the mass of the object, even though 
the accelerations produced by Fi and F2 individually are. 

Physicists nowadays seem resigned to the stance of regarding the parallelo- 
gram law as just another law based on observation, and mechanisms like this 
may be used to illustrate it in classroom settings. 




In [M], Chapter I, section III, there is a picture of a much more elaborate 
mechanism for illustrating the parallelogram law, but even it would probably 
only yield one or two decimals of accuracy. Since the parallelogram law is 
presumably an experimental fact, and since virtually everything in physics de- 
pends on it, one might expect it to be tested to great precision, like the precise 
experiments to test the proportionality of weight and mass. 

But no one ever mentions such experiments! I think that's probably because 
every one thinks that somehow the result must really be a theorem, and New- 
ton is certainly not the only who tried to present it as such. Numerous mathe- 
maticians (and I mean real mathematicians, not circle-squarers) have provided 
"proofs". The first was Bernoulli (though it was Jean Bernoulli, not his more 
famous brother, of the Bernoulli numbers), followed by Laplace (of the Laplace 
transform) in his great work [L], and by Poisson (of the Poisson integral) in [P] 
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and by Hamilton (of Hamiltonian mechanics and the Hamilton-Jacobi equa- 
tion) in [H]. 

If you try to read these proofs — supposedly mathematical proofs of a non- 
mathematical fact — you will see that they are all shrouded in a somewhat im- 
penetrable veil of unstated assumptions, making it all the harder to read them, 
and of course in those days even mathematical results were stated in strange 
ways, and the proofs are nowadays hard to read. Since the proofs, no matter 
how complicated, or how elegant, all share the same fatal flaws, I'll give pride 
of place to Bernoulli's proof, and present the main idea, based on the account 
in [M] (not bothering with some details, since the only point of presenting it is 
to demolish it). 

We restrict our considerations to K 2 , so that we are only treating forces in 
one plane. As we have already indicated, the first basic assumption is that two 
forces v, w acting together have the same effect as some other force. Thus, we 
are assuming that for each pair v, w e R 2 we have another element v © w e R 2 . 
We presumably shouldn't object to assuming that v © w = w © v, and also that 
v © v = 2v (an equation inherent in the very discussion at the beginning of 
this lecture). More generally, of course, we could assume that any k -fold sum 
vffi •• • ©v makes sense, and has the value k\, without worrying about the order 
in which the operations © are performed, but for the moment we will leave 
aside the question of what other assumptions would be reasonable. Our goal is 
to show that v © w is simply the usual vector sum v + w. 

We begin by considering two perpendicular vectors, vi of length a, and V2 of 
length b, and let x be the length of vi © V2. Let W2 be the vector on the line 




perpendicular to Vi © V2 with length and let Wi be the vector along Vi © \2 

2 

of length 2_. We now have 

length wi = § • length Vi 
length W2 = f • length V2 
length vi = § • length(vi © v 2 ), 

and the angle a from W2 to Vi equals the angle from Wi to V2. This means 
that there is an orthogonal map T — involving a rotation through the angle a, 
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together with a reflection — such that 



|7 , (v 1 ) = wi 
f T(y 2 ) = w 2 

Vl- 



fr(vi ©v 2 ) 



From this we conclude that 

(1) 



Wi © W2 = Vl. 



Here we are using the assumption, almost never explicitly made, that we 
should have T(y © w) = 2"(v) © T(v/) for all orthogonal maps T, but this is a 
perfectly reasonable assumption to add, as it merely expresses our experience 
that the laws of physics seem to be independent of orientation. 

Now, similarly, we consider vectors Zi and z 2 of lengths % = an d ^r, 



V] © v 2 




respectively, and conclude that we have 

(2) zi©z 2 =v 2 . 

Since w 2 = — z 2 , equations (1) and (2) give 

(3) wi ©zi = vi ©v 2 . 

But wi and zi lie along Vi © v 2 , so the length of Vi ffiv 2 is the sum of the lengths 
of wi and Zi , which means that 



a 
x 



X 



b 2 = x 2 



x = V7 



and thus Vi © v 2 = ae.\ © 6e 2 has length \l a 1 + b 2 . 

In other words, the length of aei © £>e 2 is the same as the length of ae\ + £>e 2 , 
and Bernoulli has thus demonstrated that Vi © v 2 has precisely the length you 
would expect it to have, in the special case that Vi and v 2 are perpendicular. 
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He then proceeds by involved arguments to prove the complete result, for the 
general case. 

The one point that is usually ignored is that in our quick trip from equations 
(l)-(2) to (3), we had to use associativity of ©, which is likewise used in all the 
other proofs that have been fashioned. But if we assume associativity of ©, then 
everything is essentially trivial: Consider the map 

(a, b) = a&\ + be2 i — > aei © b&2- 

If © is associative then this map will be linear. But it takes ei to ei and e2 to e2, 
so it must be the identity. Q.E.D. 

So in the end, I really don't know what to say about the parallelogram law. 
I think we do have to resort to the modern view that it is an experimental fact, 
and then just wonder why no one has ever done an experiment to test it! 
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HOW NEWTON ANALYZED 
PLANETARY MOTION 

After the "Definitions" and "Axioms" sections of the Principia, we get to 
Book 1, "The Motion of Bodies". This begins with a preliminary section that 
basically treats elements of calculus in a geometric guise, and then Newton im- 
mediately starts the next section with "Kepler's second law", though he doesn't 
mention Kepler's name. 

This law, actually the first that Kepler discovered, says that the radius vector 
of a planet sweeps out equal areas in equal time, or equivalently that the area 
swept out in time t is proportional to /. 




Newton pointed out that this is a consequence of the fact that the gravitational 
force that the sun produces on the planet it always directed along the line from 
the planet to the sun, or equivalently, that the acceleration of the planet is always 
directed toward the sun — the specific magnitude of this force being irrelevant. 
Or as Newton expressed it, 

Proposition 1. The areas which bodies made to move in orbits describe by radii 
drawn to an unmoving center of forces lie in unmoving planes and are proportional 
to the times. 

This turns out to be extremely easy to prove analytically, especially if we use 
the cross product of vectors. For simplicity assume that our force is always 
directed toward the origin O, and let c be any particle. We always have 

(c x v)' = (c x v') + (c' x v) = (c x v') + (v x v) 

= cxt', 

so if v' points along c, we just get (c x v)' = 0, and consequently the relation 
(*) c x v = w w a constant vector. 

If w = 0, then v(/) always points along the line from O to c(t), and our 
particle must simply be moving along a straight line towards O. If w ^ 0, then, 
since the inner product satisfies 

0= (c(f)xv(0,c(0) = (w,c(0), 

we see immediately that c(t) always lies in one plane. 
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Moreover, c(t) x y(t) has a natural interpretation in terms of the area swept 
out by the radius vector. For small h, this area, S{t+h) — S(h), is approximately 




the area of the shaded triangle, and thus approximately 

-\c(f)x[c(t + h)-c(f)]\. 
Consequently, in the limit we have 



S'(t) = - lim 



c{t) x 



c(t + h) - c(0 



= -|c(0xv(0|. 

Thus, (*) implies that S'(t) is constant, or that S(t) is proportional to t. 
We can also simply write c as 

c{t) = (R(t) cos 6(t), R(t) sin 6(t)), 

compute v = c' and observe that 

|cxv| = R 2 9', 

and j-R 2 9' is just the integrand required to compute areas in polar coordinates. 

Newton's proof is completely different. It is a a geometric proof, approximat- 
ing the curve by a polygon, and it is not only simple, but it also seems to show 
just why the proposition is true. 

Newton uses the following diagram in his proof 
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but we only need to consider a small portion of it: 



C.-y' ; 



v-,.- 




A 



B 



Newton assumes that the particle follows the path ABCD . . . , receiving "im- 
pulsive" forces at short equal intervals of time, and that these impulsive forces 
at B, C, . . . are always directed toward S, so that the path sweeps out the tri- 
angular areas ASAB, ASBC, .... Newton merely has to point out that, if not 
for the impulsive force at B, the particle would move to c, with Be = AB. In 
this case, it would sweep out the triangle ASBc, which has the same area as 
ASAB (since they have equal bases, and the same height). The impulsive force 
applied at B will instead send the particle to C, which will be at the diagonal 
of the parallelogram formed by Be and a line BV pointing along SB, since we 
are assuming that the force is directed toward S. This means that Cc is parallel 
to BV, and this in turn means that ASBc has the same area as ASBC (since 
these triangles have the common base SB and the same height above that base). 
In short, the area of ASAB is the same as the area of ASBC, and so on, all 
along the path! 

It is also noteworthy that Newton expressly states the converse of Proposition 1 
(the proof being pretty much the same): 

Proposition 2. Every body that moves in some curved line described in a plane 
and, by a radius drawn to a point, . . . describes areas around that point proportional 
to the times, is urged by a centripetal force tending toward that same point. 

Newton's proof of Kepler's second law is sometimes presented in elementary 
physics books, but then we are told that the rest of Newton's arguments won't 
be given because they require many abstruse properties of conic sections which 
are unfamiliar to us nowadays. But this is rather insincere. While it is quite 
understandable that a geometric proof would use many geometric properties of 
conies, the real mystery is how an hypothesis about inverse square forces is going 
to be related to geometric properties of conic sections. Moreover, although we 
won't pursue Newton's argument in its entirety, it turns out that Newton's strategy 
for the proof is extremely clever, far too clever for most of his contemporaries 
(and even for some people today). 
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To see how Newton relates the forces to the geometry, we need only follow, 
with slight modifications, a few steps that he adds a bit later on. In the dia- 
gram for Proposition 1, breaking up the motion into small intervals h of time, 
consider the segment BB' from B to the midpoint B' of the diagonal AC. This 




is half of B V, which represents the displacement due to the central force of 
magnitude F at B, and this distance is just jFh 2 . So, in the limit, 

lim —BB' = ±F. 
h^o h 2 4 

Or as Newton phrases it, 

If . . . a body revolves in any orbit about an immobile center and de- 
scribes any just-nascent arc in a minimally small time, and if the sagitta 
of the arc is understood to be drawn so as to bisect the chord and, when 
produced, to pass through the center of forces, the centripetal force in 
the middle of the arc will be as the sagitta directly and as the time 
twice inversely. 




Sagitta is an old fashion term [from the Latin for arrow] , and Newton's state- 
ment explicitly indicates that he is referring to the segment XY of the line 
through S and the midpoint X of the chord AC. The fraction j doesn't ap- 
pear in Newton's statement because the result is phrased as a proportion: the 
ratio of the centripetal forces at points A and A' is the same as the ratio of the 
limits lim XY j h 2 for arcs starting at A and A', respectively. 
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Newton next considers a point P at time / on a curved path around the 
center S, and two points Q, Q' on the path, at two nearby times t — h and 
t + h. Then the force at P is proportional to 




lim —r-. 



But, by Proposition 1, h is proportional to the area of the (curved) triangle SPQ, 
and thus, in the limit, to QT x SP, where QT is perpendicular to SP. Thus, 
finally, the force at P is proportional to 



PX 



lim — 
h^o (SP) 2 x (QT) 2 



Newton, however, actually presents a figure that has a tangent line drawn at 
P, and the line QR drawn parallel to SP, with the assertion that the force at P 




is proportional to 

(F) 



lim -. 

h^o (SP) 2 x (QT) 2 



Of course, QR is not actually equal to PX, but it is apparently obvious to 
Newton that it is equal to second order so that the limit still holds (something 
that I have not had the courage to try to confirm). 

And now Newton is all prepared to show that the orbit of an object moving 
under an inverse square force is a conic section. Newton begins in a way that 
might seem strange to us, by proving a partial converse of this assertion: 
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Let a body revolve in an ellipse; it is required to find the law of the centripetal force 
tending toward a focus of the ellipse. 

In other words, given a path c lying along an ellipse, if c" always points towards 
one focus of the ellipse, Newton is going to show that 

k 

\ c (01 — "77 — r for some constant k, 
d(ty 

where d(t) is the distance from c(t) to the focus. 

The relation (F) is the key to all this. In fact, in view of (F), our assertion is 
equivalent to saying that for an ellipse we have 

r Q R 

hm — — —z is a constant. 
h^o (QT) 2 

This limit has nothing to do with forces, and is completely determined by the 
shape of the ellipse. It could even be computed by a double application of 
L'Hopital's Rule: If F(h) denotes QR and G(h) denotes (QT) 2 , then we have 
lim F'(h) = lim G'(h) = 0, and 

QR F"(0) 

lim = ; 

h^o(QT) 2 G"(0) 

when the quite unpleasant calculation is carried through, it turns out that 
F"(0)/G"(0) is independent of the point P, and in fact = a /2b 2 for the el- 
lipse shown below; the reciprocal, 2b 2 /a, is the length of the classical latus rectum 
of the ellipse, the segment cut off by the ellipse on the vertical line through one 
of the foci. 




Newton proves exactly this result geometrically, and the proof is indeed long, 
complicated, and depends on numerous results about the ellipse. For a complete 
exposition of this proof see [N-C-W], pp. 325—330. 

Newton then gives a similar proof for a body moving on a hyperbola, and 
finally a proof for a body moving on a parabola. 

And immediately after this, stated as a corollary, comes the result we were 
anticipating: 
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Corollary 1. From the last three propositions it follows that if any 
body P departs from the place P along any straight line PR with any 
velocity whatever and is at the same time acted upon by a centripetal 
force that is inversely proportional to the square of the distance of 
places from the center, this body will move in some one of the conies 
having a focus in the center of forces; and conversely 

In other words, Newton is saying that this Corollary, which constitutes the 
converse of the previous three propositions, automatically follows from them! 
In the first edition of the Principia, this is all that Newton wrote, again failing to 
comprehend that not every one was as bright as he was. And people who should 
have known better, like Bernoulli, actually seemed to think that Newton had 
made some sort of logical error, and even today there are people, who should 
know better, who have suggested that perhaps Newton didn't quite understand 
the difference between a proposition and its converse! 

So in the second edition, Newton added a bit more, and in the third yet 
another bit. 



Corollary 1. From the last three propositions it follows that if any 
body P departs from the place P along any straight line PR with any 
velocity whatever and is at the same time acted upon by a centripetal 
force that is inversely proportional to the square of the distance of 
places from the center, this body will move in some one of the conies 
having a focus in the center of forces; and conversely. For if the focus 
and the point of contact and the position of the tangent are given, a 
conic can be described that will have a given curvature at that point. 
But the curvature is given from the given centripetal force and velocity 
of the body; and two different orbits touching each other cannot be 
described with the same centripetal force and the same velocity. 

Probably Newton should have added even a bit more, because there are still 
people who can't see the argument, which we'll give explicitly. For simplicity 
we simply work in R 2 , and we choose the origin O as the point toward which 
the force is directed. Given a point P, and a tangent vector v at P, we want to 
find a curve c — {c\,C2) with c(0) = P and c'(0) = v satisfying 

M c"(t) - — . Z^l 

( ) ( ) "k(0l 2 KOI ' 

where k is a given constant (the factor —c(t)/\c(t)\ is just a unit vector pointing 
from c(t) to the origin). 

Since we know c'(0), and (*) gives us c"(0), we know what the curvature k 
of c at should be, since this is given by 

ci'(0)c2"(0)-C2'(0)ci"(0) 



K — 



(Cl' 2 (0) + C 2 /2 (0)) 3 /2 
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a formula known to Newton and his contemporaries, though they probably 
defined curvature in terms of the osculating circle. 

Now consider a conic section K having O as a focus, which passes through P, 
and is tangent to v at P, and whose curvature at P is this k , assuming for the 
moment that such a conic section exists. Consider a curve y with y(0) = P, 
and which traverses K in such a way that the areas cut out by radii from O 



area = constant • t 




is proportional to the time. Such curves are determined up to a multiplicative 
change of parameter; by choosing the appropriate multiplicative constant, we 
can arrange for y'(0) = v. According to our converse Proposition 2, we have 

7 V ' \Y(t)\ 2 \Y(t)\ 

for some k. But we must have k = k, since we chose y so that its curvature at 
y(0) = P would be the k given by (**). 

Thus, y is a solution of our differential equation (*), and by uniqueness (which 
of course Newton and all his contemporaries implicitly assumed) it is the only 
possible solution. 

(The only slight lacuna in this argument is the existence of a conic section with 
the required curvature, and Newton gives a geometric solution to this problem 
a little later on.) 



Moral: It's dangerous to be a lot smarter than every one else. 



Of course, Newton's argument might strike us as a little weird, starting as it 
does with the converses of the result we want, but a geometric proof almost has 
to be of this nature: it's a lot easier to start with a geometric object, an ellipse, 
or hyperbola, or parabola, and deduce a formula for forces, than it would be 
to start with the formula for forces and somehow conjure up these geometric 
arguments. 
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I should also add that Bernoulli was finally reduced to admitting the correct- 
ness of Newton's argument (see [N-H-T], Volume 7, pp. 77-79), defending his 
own analytic proof as being more direct. But, in fact, Newton also has an ana- 
lytic proof later on in the Principia, as Proposition 41, although it is presented 
completely geometrically, with a terrifyingly complex figure. You can find this 
stated on page 529 of [N-C-W] and starting on page 334 of this book's pre- 
liminary guide to reading the Principia there is a complete "translation" into 
modern calculus. See also [Gu] for a thorough treatment of the disputes that 
arose. 

Although relation (F) was so crucial for analyzing inverse square forces, New- 
ton did not immediately apply it to that task, but first showed other ways in 
which it could be used, which are also rather interesting. The analysis is purely 
geometric, and for the remainder of this lecture you may think that you're back 
in geometry class again, and a really hard one at that (Newton and his contem- 
poraries were really good at geometry!). 

First Newton asks what central force at a point S will cause a body to move 
on the circumference of a circle when S is not the center of the circle. Newton 
considers a point P on the circle, with a nearby point Q. Then (a) we draw the 
line from P to S, intersecting the circle at V. Next (b) we draw the diameter 




VA, and then (c) we draw the line TQ perpendicular to SP, and extend it until 
meets the tangent line to the circle at P in the point Z. Finally, (d) we draw the 
line LR parallel to VP intersecting the circle at L and the tangent line at R. 
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In (c) we also draw the radius to P, which is perpendicular to ZP and con- 




clude that Z.TZP — a, so that the right triangle ZTP is similar to the right 
triangle VP A 

We also need to recall the elementary geometry theorem that for two secant 
lines through a point, we have the relation BC ■ CD = BE ■ BF, and the obvious 




consequence when one of the secant lines is actually a tangent line (E = F), 
Now, since LR is parallel to VP in (d), we have 

RP ZP 



QT ZT 

AV u 
= by similar triangles. 

Squaring, and applying the above mentioned geometric theorem to the se- 
cant RL and the tangent RP, we then have 

QR ■ RL (AV) 2 



or 



and thus 



(QT) 2 (PV) 2 ' 
QR 1 (AV) 2 



(QT) 2 RL (PV) 2 ' 
QR 1 (AV) 2 



(SP) 2 (QT) 2 RL (SP) 2 (PV) 2 ' 

The left side is the fraction that appears in formula (F), and RL — > PV as 
Q — > P, while A V is a constant, so the force must be inversely proportional to 
(SP) 2 -(PV) 3 . 
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Although this result appears in the first edition of the Principia, the following 
three remarkable corollaries first appeared in the second edition: 

(a) If a particle moves in a circle under a central force directed to a point V on 
the circle, then the force varies inversely as the fifth power of the distance. (An 
inverse fifth power law is one of the select few, together with the inverse square 
and inverse cube laws, for which we we can give analytic solutions.) 

(b) More generally, suppose that a particle moves on a circle under two different 
central forces, one with center at R and one with center at S. Let SG be drawn 




parallel to RP, intersecting the tangent line to the circle at P in the point G. 
Then the ratio of the first force to the second is 

(RP) 2 • (PTf SP • (RP) 2 ( RP) 2 . SP 

(SP) 2 • (PL) 3 " iS pf.(PVf (SG) 3 ' 

(Fry 



(c) This same result holds for an arbitrary orbit under central forces directed 
toward R and S : if the two forces result in the same orbit, then the ratio of the 
first force to the second is 

m (RP? • sp 

(R) (SG) 3 ■ 

For we just have to consider the osculating circle to the orbit at P. 

One other situation that Newton analyzed before tackling elliptical orbits 
under a force directed toward the focus was the question of an elliptical orbit 
under a force directed toward the origin. His treatment of that problem was almost 
as complicated as his later treatment of a force directed toward the focus, but 
it's trivial for us to give an immediate solution: 

We simply note that a particle moving under the orbit 

c(t) = (acosat, b sin at) 

has c"(t) = —a 2 c(t), so that it is a possible motion under a central force varying 
directly as the distance to the origin, and it is then easy to conclude that the same 
holds for any ellipse. 
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This result also appeared in the first edition, perhaps just as a warm-up for 
tackling the case of a force directed toward the focus, but in the second edi- 
tion Newton combined this result with the result about the ratio (R) to give an 
alternative treatment of the focus case. 

We will need another property of the ellipse, but it is one that we can easily 
derive from familiar ones. Recall that the ellipse with major axis 2a and foci F\ 
and F2 is defined by the properly that F\ P + PF2 = 2a for all points P. It also 




has the "focal point" property that a light ray starting from F\ passes through 
i*2 5 i-e.j that the two angles indicated in (a) are equal. 

In diagram (b) below we have drawn lines parallel to the tangent line at P 
through O and F2. The two angles indicated by thick arcs are equal, so the 
two thick segments have the same length j3. And the two segments with lengths 
indicated as a are equal because F\0 = OF2. Thus 

2a + 2/6 = 2a. 

Finally (c), moving XP over to OG we see that a line through the origin O 




parallel to the line F\ P always intersects the tangent line through P at a point G 
with OG = a. 

From this and the result (R) for the ratio, it is now easy to conclude that the 
result for a force directed toward a focus of an ellipse follows from the very 
elementary result for a force directed toward the center. 

An analytic reworking of this whole circle of results can be found in Appen- 
dix 1 of [A]. 
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SYSTEMS OF PARTICLES; 
CONSERVATION LAWS 

This lecture is fairly straightforward, and serves mainly as a prelude for the 
next, when we finally get to consider rigid bodies. 

We will be considering a "system of particles". This means that we have 
certain particles c\,...ck ■ R — >■ R 3 , with positive masses m\ , . . . , mx € R. We 
also have certain forces, functions 

Ff : R ->■ R 3 , 

the "external" forces on the particles (for example the force of gravity for a 
system of particles near the earth), as well as "internal forces" 

Fy = —Fji : R — > R 3 , 

where Fy represents the force exerted by Cj on a . In accordance with the third 
law we are assuming that 

Fy = -Fji. 

In addition, we will soon need the "strong version" of the third law, which says 
that F(j points from a to Cj, i.e., Fy is a multiple of c, — Cj. This stronger 
version of the third law is never stated by Newton, although one might assume 
that he considered this implicit, on symmetry grounds. 
Finally, if we set 

F i =F e i 

j 

so that F,- is the total force on the particle c,- at time t, then we assume, in 
accordance with the second law, that 

F, = mi ■ c/'. 

The short demonstration in Lecture 1 of conservation of momentum involved 
just 2 particles and only the internal forces between them, but we can easily state 
a generalization for an arbitrary system of particles with external forces. If we 
set F = Ff , the total external force, then 
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Note that here we have to regard the various F? simply as elements of M 3 , rather 
than as tangent vectors at different points of M 3 , and similarly for the v,\ 

The proof can be left as an easy exercise, but is included in a variant of the 
formula that is made to look a lot prettier by introducing the center of mass 
of the system {a}, which represents the "average" position of the particles c,- 
weighted according to their masses: 

= E/ m • a 

More precisely we should define the center of mass as the particle consisting of 
the path C with the mass M = ^ • m ( - . 

If all F? = 0, so that J2i m i ' y i i s constant, then C" — J2i m i ' c i" — 

Jrl2i m i ' y i' — ~M&i m ' ' v, ')'> so tnat we a ^ so nave — 0. Thus, C is 
constant; in other words, the center of mass moves with uniform velocity. And 
more generally, If 

is the total external force, then 

F = M ■ C", 

so that the center of mass particle simply moves as if it were acted upon by the 
total force F. 

For the proof we simply note that 

M.C" = 2>,.c," 

= Z> 

i 

i i j 

= E F ^ 

i 

since the double sum ^ f • F^- vanishes. 

Of course, the "particle" C might not be one of the particles in our sys- 
tem. Nevertheless, this result is seldom regarded as particularly "theoretical" — 
instead it allows us to get a very simple picture of very complex phenomena. 
For example, if we throw a twirling baton into the air, it will execute a rather 
complicated motion, but its center of mass moves in a parabola, just like a point 
mass. A striking illustration may be obtained with a time-exposure photograph 
taken when a baton is tossed in the air, with lights at the ends and the center of 
mass, giving a picture like this: 
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By the way, we usually think of a rod as a "rigid body", and that might seem 
to make the above equation even more impressive: in a real rod, with all sorts 
of complicated intermolecular forces, which make it approximately "rigid", but 
not truly so, it is still true that the center of mass moves according to a simple law. 
But that is a somewhat misleading way of construing the result, since rigidity of 
the rod is required in order to identify its center of mass with a particular point 
of the rod, on which we can attach one of the lights. 

Newton has a statement of this result as Corollary 4 in his "Axioms" section, 
but he uses the older term "center of gravity" for "center of mass". 

The common center of gravity of two or more bodies does not change its state whether of 
motion or of rest as a result of the actions of the bodies upon one another; and therefore 
the common center of gravity of all bodies acting upon one another (excluding external 
actions and impediments) either is at rest or moves uniformly straight forward. 

Notice that Newton explicitly disallows non-zero external forces, thereby fore- 
going the real interest of the corollary. Moreover, his proof is unbelievably com- 
plicated, occupying nearly two pages, referring in addition to a later Lemma 
that occurs in a completely different context and involves a strangely compli- 
cated figure. 

I'm not sure what this all means, except perhaps that simple vector calculus 
is a lot more convenient than geometry. That is certainly illustrated by a second 
conservation law, which occurs only implicitly in the Principia. 

The cross-product x, which was used at the beginning of the previous lec- 
ture basically as a convenient abbreviation for manipulations with determinants, 
turns out to have a fundamental role in mechanics. Of course, students always 
wonder why there should be a vector product in 3 dimensions, but not in other 
dimensions, and every professor seems to have their own answer. I'm going to 
give my answer now, and even if you don't like this particular answer, at least it 
will introduce an important fact. 
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For any vector w £ M 3 , consider the one-parameter family of maps B(t): M}—- 
M 3 , where B(t) is a clockwise rotation through an angle of f|w| radians around 
the axis through w [choosing an orientation (vi,V2) of the plane perpendicular 
to w so that (vi,V2, w) is the usual orientation of K 3 ]. Now consider the vector 
held generated by this one-parameter family In other words, for each pel 3 
consider the curve B p {t) = B(t)(p), and then look at the tangent vector X p of 
this curve at 0. 



To compute X p geometrically we note that X p is clearly perpendicular to 
both p and w. Its length is also easy to determine. When p happens to lie 
in the plane perpendicular to w, as in (a), the point p rotates in a circle of 



radius |p|, and X p has length |p| • |w|. More generally (b), the point p rotates in 
a circle of radius |p| • |w| • sin 9, where 9 is the angle between w and p. Thus, 
X p is just the geometrically defined cross-product p x w. 

For an analytic determination of X p = B p '(0), we note that since the B p (t) 
are all orthogonal, and B p (0) = I, the derivative B p '(0) is skew-adjoint, with a 
skew-symmetric matrix M, which we will write in the form 






(a) 



(b) 



M = 




Then the vector X p is the 3-tuple 

= (Pi,P2,P3) ■ M 




= (P2U>3 ~ P3M2, PlM\ - Pl0)3, p\Cx>2 ~ P20)\). 
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Setting (o = ((01,0)2,0)3), this is just the cross-product p x w. Moreover, w is 
easy to identify, because 

By,{t) — w for all t = I„=wxu, 

which means that (0 is a multiple of w, and it easy to check, by considering 
some specially chosen vector, that in fact (0 = w. Thus, one might say that 
the cross-product x is special to R 3 because n — 3 is the only dimension where 
0(«) has dimension n. In any case, what we have shown is that 

The vector fields in R 3 generated by rotations about an axis are of the form 
p h> p x w for w e R 3 . 

For a particle c with velocity vector v we can consider the function c x v from 
R to R 3 . This function is called the angular velocity of the particle, and if the 
mass of the particle is m, the cross-product L = c x my is called its angular mo- 
mentum. It would be more precise to call these quantities the angular velocity 
and angular momentum with respect to the origin 0. For any other point P, 
the angular momentum with respect to P is the cross-product 

hp — (c — P) x my, 

For a system of particles (c\, . . . ,ck) we define the angular momentum L of 
the system with respect to as 

K 

L = ^2 Ci x m ' y '> 
i=l 

here it is naturally necessary to consider all a x m,v,- as vectors at a single 
point, rather than as tangent vectors at different points. And, more generally, 
we define the angular momentum L p with respect to P as 

K 

hp = -P) x m, v;. 

1=1 

In particular, suppose we take P to be the center of mass C of the system 
(this means that we will be considering the angular momentum with respect 
to different points at different times). Letting M — ^2jtni, the "mass" of the 
particle C, we then have 

^ ntid x v,- = nit (ci - C) x v, + ]P w, C x v ; - 

= L C + [C x (E/^v,)] 
= L c + [C x MC], 
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so that we can write 

L = L c + (C x MC). 

The vector Lc, the angular momentum with respect to the center of mass, is 
also called the "rotational angular momentum", so our equation says that the 
total angular momentum L is the sum of the rotational angular momentum Lc 
and the angular momentum of the center of mass with respect to 0. 

If instead of a momentum vector we consider an arbitrary force F at a point c, 
the cross-product 

x = c x F 

is called the torque of the force with respect to 0, while xp — (c — P) x F is the 
torque with respect to P. (Although I have used the physicists' L for angular 
momentum, I couldn't bring myself to use the standard N for torque.) 

Similarly, we define the torque of a system of forces on a system of particles; 
here it is again necessary to consider the individual torques as being vectors 
at one point, even though we naturally think of the forces as being applied at 
different points. 

All these definitions finally enable us to state: 

If our system satisfies the strong form of the third law, then the total torque 
is the derivative of the total angular momentum, 

T = If. 

For the proof we have 

L' = I ^ a x m,v, J = ^ c{ x m,-v/ + ^ c, x m,v/ 

i ' i i 

= + J^d X F/ 

i 

= j>x F ? +EE c ' xF « 
( ( j 

' j 

The strong form of the third law allows us to write 

F/; = Xij(ci — Cj), 
with Xij = Xji since F^- = —Fji, so we have 

£ Ci x ¥i J = EE k 'j & x Ci - ct x c i i 

i j i j 

ij 

This double sum vanishes, since Xij — Xji, while c,- x Cj — —Cj x a. 
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An easy calculation then shows that, more generally, for any point P, 

xp = Lp'. 

In particular, of course, if the torque is 0, then angular momentum is con- 
served. This certainly happens in the special case of a single particle mov- 
ing under a central force, where the external force F is a multiple of c, so that 
x = c x F = 0. This was noted by Newton as the first Corollary of his Proposi- 
tion 1 in the previous lecture. 

Corollary 1. In nonresisting spaces, the velocity of a body attracted 
to an immobile center is inversely as the perpendicular dropped from 
that center to the straight line which is tangent to the orbit. 

Even the somewhat more general rule that angular momentum is conserved 
in the absence of external forces was not stated until quite some time after- 
wards, and this law was known for a long time simply as "the law of areas", or 
Flachensatz in German. 

A standard elementary illustration of the law of conservation of angular mo- 
mentum is provided by a person seated on a rotating stool with arms extended 
out holding weights, and then increasing the speed of the spin, often quite dra- 




matically, simply by pulling the weights inward. Similarly, ice-skaters speed up 
their turns by pulling their arms in; divers, starting their dive with a small an- 
gular momentum, do rapid somersaults by pulling their arms and knees in; and 
gymnasts do all sorts of tricks. 

By the way, without appealing to conservation of angular momentum we can 
explain the speed-up as a simple consequence of the parallelogram rule for 
forces, or even for velocities: the sum of the velocity v that the weight already 
has and the velocity w that it acquires as a result of the inward pull lies along the 




diagonal of the rectangle spanned by these two, and consequently has a greater 
length. 
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In these examples, we merely altered the given non-zero angular momentum, 
but something interesting occurs even when we start with angular momentum 0. 
Moving the weights along a circle in one direction contributes a certain amount 



of angular momentum to the system of weights-plus-person, which must be 
countered by an opposite amount of angular momentum in the system, so the 
seated person must rotate in the opposite direction. At the end of the motion, 
when the weights are no longer being rotated, the person will have stopped 
rotating, but will be facing in a different direction; cats use this mechanism to 
land on their paws even when dropped from an upside-down position. 

In this respect, rotation is quite different from linear motion. A system cannot 
change its position using only internal forces, and no external forces. On a 
perfectly frictionless ice surface you can change the direction in which you are 
facing, but you can't move the position of your center of mass. (Of course, you 
can forcefully exhale, providing yourself with rocket propulsion, making use of 
the fact that the air inside your lungs is a part of your system that you aren't 
attached to; or you could just throw your coat away.) 

Conservation of momentum and angular momentum are the two great con- 
servation laws of mechanics. Of course, there's also Conservation of Energy, 
which receives much greater star-billing in modern physics textbooks. But the 
general principle of conservation of energy involves much more than mechan- 
ics. There is still an important conservation of energy principle for mechanics 
itself, involving "conservative" forces, but I will skip over that in these lectures, 
since our main aim is to see how the notion of rigid bodies can be handled, and 
to investigate other developments to which this treatment leads. 




o 
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It's finally time to address the problem with which these lectures began: How 
do we analyze rigid bodies. Since Newton's laws are basically about "parti- 
cles", it is natural for us to think of a rigid body as a large system of particles, 
especially since this corresponds to our modern ideas involving atoms. But the 
analysis is hardly straightforward, because rigid bodies are idealizations, in fact 
idealizations that don't seem to have any theoretical counterpart. 

For a system of particles to constitute a rigid body, the distances between 
them would have to remain constant, no matter what forces are applied. In 
special relativity theory this would be impossible in principle: it would mean 
that applying a force at one end of a rigid body would cause the other end of 
the rigid body to move immediately, so that information would be sent from one 
of the body to the other instantaneously. 

Even in classical mechanics, there is no way for a system of particles to act as 
a rigid body. The internal forces between two particles of a rigid body must be 
when they are at their original distance d apart, but become strongly repulsive 
if the distance is slightly smaller than d and strongly attractive if the distance 
is slightly larger than d. If two opposing forces are applied to the particles at 
the end of a rigid body, they will initially push them slightly toward each other, 
producing a strong repulsive force, which will not only return the particles to 
their original position, but actually cause them to move slightly further apart; 
this, in turn, will produce large attractive forces, now moving the particles back 
toward their initial separation, and slightly beyond, causing the repulsive forces 
to act again. Thus, we would expect the particles to vibrate around their original 
separation, which is more or less what actually happens in a real-world rigid 
rod made of molecules. 

Nevertheless, it turns out that we can still ask when a system of particles is 
acting in a way that we would expect a rigid body to act. 

The simplest question we can ask is when a system of particles should be 
considered to be in equilibrium under a set of external forces. As the very 
simplest possible example, let's consider a "rigid rod" that consists of just two 
points bi and \)2 (representing two molecules, say) at a distance d apart, and 
"external" forces F,- acting on b,-. These forces might be produced, for example, 

Fj F2 
bi • -, • b 2 
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by some one exerting equal but opposite pressure on both sides of this rod. 



If Fi = — F2, then we would expect this rigid rod to be in equilibrium under 
these forces, and we can justify this expectation by noting that if we consider a 
force F21 on b2 equal to — F2 and a force F12 on bi equal to — Fj, then these 
"internal" forces F12 and F21 do satisfy Newton's third law, and together with 
the forces Fj and F2 they leave our rod, consisting of bi and b2, in equilibrium. 

Fj 

bi •— 

< 

To be sure, this picture becomes quite a bit hazier if we try to imagine how 
these "internal" forces would arise as the forces F,- are applied. This would 
lead us right back to the "realistic" picture of rapidly vibrating molecules. And 
it won't help to consider the limiting situation as the constraining forces of the 
molecule are made greater and greater, because this simply causes the molecules 
to vibrate more and more rapidly — although they will stay closer and closer to 
their natural separation, their motions will not approach a limit. 

So instead, we will consider our abstract rigid rod to be in equilibrium 
simply because such forces can be defined, without worrying about the details 
of just how these forces would actually arise in practice, for rods that aren't 
ideally rigid. 

More generally, let us consider a collection of points bi, . . . ,bx, which it will 
sometimes be convenient to regard as a single object, b = (bi, . . . ,bx), as well 
as a collection of forces F = (Fi, . . . Fjf), where we regard F/ as acting on b,. 
Then we can make the following definition: 

The collection of points b is in rigid equilibrium under the forces F if there 
exist "internal" forces Fy = — F/j which are multiples of b; — bj such that 

Much more colloquially, of course, we just say that "the rigid body b is in 
equilibrium under the forces F". 



F 2 

#b 2 

F21 
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So now we have a perfectly meaningful definition, which doesn't require the 
notion of a rigid body directly There's only one problem with this definition: 
it doesn't work. For example, we presumably ought to have equilibrium for the 
rod shown below, where there are equal forces F at the ends of the rod together 




with a force — 2F in the middle. But these forces obviously can't be balanced by 
forces that are multiples of the vectors b, — b/ . Of course, in practice, the rod 
will bend a bit, and in this situation the necessary "internal" forces will exist. 




Fortunately, we can stick with our strict theoretical model if we represent the 
situation by a slightly more realistic figure, with a few extra "molecules", so that 



bi« 



T. 



-2F 



b 2 « 

b 2 '« 



b 3 « 

b 3 ' 



once again the required internal forces will exist. 




We will normally presume that our particles do not lie on a straight line, or 
even on a plane, and in realistic situations the number of particles should be 
much greater, although special cases may be useful for illustration. 
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It should be pointed out that these internal forces will almost never be unique. 
They will be unique if our system consists of just 4 points bi, b2, b3, b4 not on a 
plane, but as soon as we have more points there will be many possible choices. 




In fact the internal forces aren't unique even for the special case of a "rigid 
rod" consisting of particles bi, . . . ,bsr lying on a straight line. Given equal and 
opposite forces F and F' on the ends bi and bx, we could choose just two 

F bi b2 b3 btf pv 
>• • • < 

Fur Fjn 

forces ¥\x and Fjn between bi and bx, essentially ignoring all the particles 
between them, but it would be more natural to balance F with a force F21 

F bi b 2 b 3 b K p/ 
► •••••• < 

F21 Fj2 

< >• 

F32 ^k,k-\ 

exerted on bi by b2, requiring an equal but opposite force F12 on b2, which 
would in turn be balanced by a force F32 exerted on b2 by b3, . . . , leading 
finally to a force Fk,k-\ exerted on bx by b^r-i that balances F'. 

This corresponds much better to our intuitive idea of a rigid rod: the mol- 
ecules at one end merely influence the nearby molecules, making a tiny sliver 
of the rod rigid; those molecules in turn influence molecules a little bit further 
away, making a little bit more of the rod rigid, etc. 

Now mathematicians know that it's always rather hard to work with non- 
unique data, so one always looks for an alternative way of handling such prob- 
lems. In the case of rigidity this is obtained by considering "rigid motions" of b. 
By this we simply mean a collection of paths c = (c\,. . . ,c j k) with C;(0) = b, 
such that each 

\ci(t) -cj(t)\ 2 = (Ci(t) -Cj{t), a{t) -Cj(i)) is constant. 

Alternatively, we might think of a rigid motion as a curve t 1— >• A{t) of isometries 
of R 3 , with a(t) = A(t)(a(0)) = A(t){bi). 
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Given such a rigid motion, consider the K-tuple of tangent vectors 
v= (V!,...,vjr) = (ci'(0),...,cx'(0)) 6 (M 3 )* 
Differentiating the equation 

(ci(t) —Cj(t), Ci{t) —Cj(t)) — constant 
and evaluating at gives us 
(1) (v / -^,b,-b;>=0. 
Since the force F{j is a multiple of b, — b/ , this implies that 

(v ; -v;,F i7 ) =0. 

Consequently, 

!,7 1,7 ',7 

= - ^ (v, , F,y ) (interchanging i and y) 

and thus 

X>,F y ) = o. 

This in turn means that the external forces F& in the condition for rigid equi- 
librium satisfy 

£*< T *' F *> = -E*,/ T *' F jy> = °> 

or simply 

(*) £< V *' F *> = °- 

Physicists refer to these AT-tuples v = (c\'(0), . . . ,Ck'(0)) for rigid motions C 
of b as "virtual infinitesimal displacements" of b. The word "infinitesimal" 
in this phrase shouldn't surprise us — it's just the standard physicists' way of 
referring to tangent vectors. As for the word "virtual" here, it has about as 
much meaning as it does in the phrase "virtual reality". Basically it refers to 
the fact that although we have obtained equation (*) under the assumption that 
our rigid body is in equilibrium, we have done so by considering tangent vectors 
to "virtual" rigid motions, i.e., motions that our rigid body might have had if it 
weren't in equilibrium. 

This can all be expressed in a more familiar, geometric, way by considering 
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the "configuration space" of b, which is the subset M C (M 3 )^ of all points 
that can be reached from b at the end of a rigid motion. In other words, 

M — { (A(bi), . . . , AQjk)) : A an orientation preserving isometry of M 3 }. 

When b is non-planar, M is a 6-dimensional manifold diffeomorphic to the set 
of all orientation preserving isometries A of M 3 , and thus to M 3 x SO (3). With 
this picture, a rigid motion of b is simply a curve in M, so a virtual infinitesimal 
displacement v of b is simply a tangent vector to M at b. 
We've already found that any such v satisfies the equation 

(1) (v i -y j ,b i -b j )=0. 

If we define linear functions (j>ij on (R 3 )^ by 

fa Oi , ■ ■ ■ , vjsr) = (v; - v; , b; - bj }, 

this says that 

Mb C P| ker fa . 

ij 

At this point we want to state a simple, but absolutely crucial, little lemma: 
Lemma. If b is non-planar, then 

Mb = P| ker fa . 

ij 



Proof. By renumbering, we can assume that bi,b2,b3,b4 are points of b that 
do not lie in a plane. There is clearly no loss of generality in assuming that 
bi = [as reflected by the fact that we can replace all b,- by b,- — bi without 
changing (1)]. Thus our assumption on bi, b2, b3, b4 amounts to b2, b3, b4 being 
linearly independent. 

Since we can also replace all v, by v, — vi without changing (1), it follows that 




= 3 + dim(|(0,V2,...,vz-) ef]kerfa 

ij 



Now for v with Vj = 0, a first application of (1) gives 

(v;,b;) = (v; -Vi,b/ -bi) =0 i = 2,3,4, 
and then a second application gives 

-{\i,bj} = (\j,bi) i =2,3,4. 



59 



LECTURE 5 



So if A : R 3 —> R 3 is the linear transformation with 

\i=Abi i = 2,3,4, 

then ^4 is skew-adjoint. But the dimension of skew-symmetric 3x3 matrices is 3, 
so the dimension of ("),- • ker </>, 7 - is at most 6, which is the dimension of My,. 

By the way it shouldn't be too surprising that the mechanics of this proof in- 
volved skew-adjoint transformations, since they are the derivatives of orthogonal 
ones (as we noted in our discussion of the cross-product); given the transforma- 
tion A of the proof, the isometries e' A would produce the given infinitesimal 
virtual displacement v. 

If we use ( , } for the usual inner product on (R 3 )^, then equation (*) can 
be written in the simple form 

(v,F) =0; 

in other words, F is perpendicular to the tangent space My,. The inner product 
of force and distance is generally called work, so this sum is also called the "(vir- 
tual) infinitesimal work" done by the forces F during the (virtual) infinitesimal 
displacement v. Our little calculation that (v, F) = if b is in rigid equilibrium 
under F is often referred to by physicists as a proof of the "principle of virtual 
work". In reality, however, when physicists use the principle of virtual work they 
almost always assume implicitly that it includes the converse: 

The Principle of Virtual Work. The virtual infinitesimal work (v, F) equals for 
all virtual infinitesimal displacements v of (a non-planar) b if and only if b is in 
rigid equilibrium under F. 

Proof. We only have to prove the converse part, that if (v, F) = for all v, 
then b is in rigid equilibrium under F. If we consider the linear function O on 
(R 3 )* defined by 

$(vi,...,vjsr) = ^(v fc ,F fc ), 

k 

then our hypothesis says that $ vanishes on My,, and thus by our lemma, 

O vanishes on Q ker </>,-_,- , 0O'( v i> • ■ -,Vk) = (v/ - V/, b, - b/)- 

ij 

It is a well-known simple result about vector spaces (not necessarily even finite 
dimensional ones) that in this case there exist constants A,,; with 

$ = -4>ij- 

ij 
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In other words, 

£>*,F*) =X^yfa-y/> b *--b;) all vi, . . . ,y K e (M 3 )* 

Choosing all V; to be except for the one vector v/, we thus obtain 
(\i , ¥[ ) = Xij (v/ , b; - bj) + In (-M , b, - bi ) 

j i 

= ^hj^hbi-bj) + ^A /7 (v/,b/ -bj) 

j j 
= +^/)(v/,b/ -b 7 ) 

= (▼/, £>y +A.y/)(b/-b/)) ) 

and since this is true for arbitrary vectors v/ in R 3 , we conclude that 

F/ = I>U + ^)(b,-b/). 

./' 

So we can define 

¥ JI = -(kij +Xji)(bi-bj) 

to obtain the required forces. 

As an extremely simple example, consider the situation shown below, where 
the upward force on b2 balances the two downward forces at points bi and b3, 
which are at different distances a and b from b2, with the magnitudes of these 
forces inversely proportional to those distances. Of course, this is merely a 



-(a + b)F 



b 2 



bF 



aF 



schematic figure, since it is linear, and we really have to assume that there are 
other points around, as in our previous examples. 
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It is easy to see that this collection of points is in rigid equilibrium under these 
forces: 

(1) For an infinitesimal displacement given by a vector z pointing in the 
vertical direction, the virtual infinitesimal work is because the upward 
force is the negative of the sum of the two downward forces. 

(2) For an infinitesimal displacement given by a vector z pointing in the 
horizontal direction, the virtual infinitesimal work is because each in- 
dividual component is 0. 

(3) For an infinitesimal displacement generated by a rotation around b2 (i.e., 
around an axis through b2 perpendicular to the plane of the diagram), 
the vectors Vj and V3 will be in opposite vertical directions, with lengths 
proportional to the distances a and b. Consequently, the virtual infini- 
tesimal work, involving vectors with length inversely proportional to these 
distances, will be 0. (One can check directly that this is just as true for an 
infinitesimal displacement generated by a rotation around either bi or b3, 
but that isn't necessary, since the set of virtual infinitesimal displacements 
that stay in the plane of the diagram has dimension 3.) 

(4) For infinitesimal displacements given by a vector perpendicular to the 
plane of the figure, or by a rotation through axes perpendicular to our 
first rotation, the virtual infinitesimal work also works out to be 0; or we 
can just simplify matters by restricting our attention to the 2-dimensional 
situation to begin with. 

Notice that this provides a fairly good schematic representation of a lever, 
which of course requires not only a rigid body, but also a fulcrum, an immovable 
point. In practice, this "immobility" is provided in a complicated way by the 



bF 



a¥ 



connections between the fulcrum and the earth, but it seems reasonable simply 
to regard this connection as a mechanism that automatically supplies the proper 
upward force to the fulcrum when the downward forces are applied at the ends 
of the lever. 

Naturally, a more realistic picture would use a much large number of points, 
forming a 3-dimensional object. But in any case, our analysis shows, especially 
when we think of the lever as bending slightly, that it is the internal forces of 
the lever that make the weights balance; in short, all the "extra force" that 
one obtains by pushing at a large distance from the fulcrum is supplied by the 
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lever itself, in its effort to preserve rigidity (together with the force that the earth 
supplies on the fulcrum, to keep it from moving downward). 

Now we are ready to apply the same ideas in the more general case. Instead 
of looking for a condition for equilibrium, we now seek a criterion for a rigid 
motion c = {c\, . . . ,cx) of b = (bi, . . . ,bsr) to be consistent with the forces 
F = (Fi, . . .Fjf). We will think of the F,- as functions on M x K, so the forces 
may depend not only on time, but also on the particular rigid motion that the 
body has undergone at any particular time. 

We now need "internal" forces F, 7 (?) = —Fji(t) with Fy(f) a multiple of 
Ci(t) — Cj(t) so that 

m l c t "(t) = Ft(c(t),t) + 'Ejfij(t) 

or 

m i c i "(t)-Y i (c(t),t) = J2jFij(t). 

The latter equation, which may be regarded as stating that the body is in rigid 
equilibrium under the forces F,- — mid", is often called "d'Alembert's principle" 
and regarded as the fundamental law — so that, as the physicists like to say, 
"dynamics reduces to statics" (in the words of [Go], Chapter 1.4). But this 
really becomes useful only when we apply the principle of virtual work: The 
conditions on the F,j imply that 

I>/,m/c,"(0-F,(c(0,0} = 

for all tangent vectors v, and, conversely, the principle of virtual work implies 
that if this condition holds, then the requisite F^- (t) exist. This leads us to the 
following definition: 

d'Alembert's Principle: The rigid motion c is a rigid solution for the 

forces F, or, more colloquially, "c is a possible motion of the rigid 
body b under the forces F", if for each t, 

£ (F,(c(0,0-«/C/"(0, y i) =° 
i 

for all tangent vectors v at M c ( t ) ■ 

If we agree to let mc denote (m\C\, . . . ,mxcx) and similarly for mc", and 
also let ( , ) denote the usual inner product on (R 3 )^, then we can write 

(F(c(0, — mc"(t), v) = for all v tangent to M c ( t ), 

or, if we are willing to tolerate a little ambiguity in our notation, simply 

(**) (F — mc", v) = for all v tangent to M c . 
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Our condition amounts to a system of second order differential equations for 
vector-valued functions: If we choose a local coordinate system x 1 , . . . , x 6 on 
the 6-dimensional manifold M, then we only have to verify (**) for v = d/dx' , 
giving us 6 equations for the vector-valued functions c' = x' o c. 

Since M is basically R 3 x SO (3), we can restate this much more concretely. 
One 3-dimensional collection of vector fields tangent to M are those of the form 
v,- = z for a constant vector z. Condition (**) becomes 

= ^(F,-m ( c/',z) 

i 

= (£;F ( -,z)-(£ ; .m ( c/',z). 
Since this must hold for all z, we must have 

(Frigid) F total =J2 F '=J2 miCi " = MC "> 

i i 

where C is the center of mass, and M = m j is the total mass [recall that F ( - 
really stands for t h> F;(c(0, ?)]■ 

We also have to consider the vector fields generated by rotations. As we saw 
in our discussion of the x-product, these are of the form V/ = Cj x K|. Thus, 
condition (**) becomes 

= ^(F,- - mid", a x n) 

i 

= ^(F,-,c,- x t|) - ^m;(c/',Cj x m) 
= ^(c,- x F;, t)) - ^m ( (c ; - x c/",n). 

Since this must hold for all x\, we must have [with F,- standing for t F,- (c(t), t)] 
(Trigid) t = x F; = ^2 met x c". 

i i 

Condition (Frigid) simply says that the rigid body must move in such a way 
that the momentum law is satisfied, while condition (T r igid) simply says that the 
rigid body must move in such a way that the angular momentum law is satisfied. 

To solve these equations we might begin by writing our rigid motion c = 
(ci, . . . , ck) of b in the form 

a(t) = fl-HOOW + wtf) 
= **(0(b,) + w(0 
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for orthogonal B(t); we've used B~ l instead of B to conform to the physicists' 
convention that B{t) is the rotation that will return the body at time / to its 
original unrotated position. Since B i B = I, we have 

B V B = —B t B' 

= -{b^bY 

= ~{B V B)\ 

so (B tr B)(t) is skew-adjoint, and its matrix can be written as 

-0) 3 (0 O> 2 (0 
ft)3(0 -o>i(0 

-0) 2 (0 0)1 (?) 

Setting w(0 = ((Oi(t), o>2(0 > 0)3(0), we then have 
v ; (0 = c/(0 = S t '(0(b ! ) + w'(0 

= (2*^05(0) OS -1 (0(b ; ))+w'(0 
= (5«'(05(0)(c,(0) + w'(0 

/ -ft) 3 (0 0) 2 (0 

= C/(0- 0> 3 (0 -ft)i(0 I +w'(0 

\-o) 2 (0 0)1 (0 

= [w(0 x c, (0] + w'(0- 

There is a natural choice for B(t) and w(0 in our description of rigid body 
motion: choose w(0 = C(t), where C(t) is the center of mass at time t, so that 
B{t ) represents the rotation about the center of mass required to move the body 
back to its initial unrotated position. We then have 

Vi -(0 = [w(0 x a(t)] + C. 

We can now write the angular momentum A of C as 

^ mid x v; = ^ niid [((0 x c,-) + C] 
i i 

= ^[mjC/ x (w x c,)] + ( ^ WjC,- J x C' 

i i 

= Y^Vma x ( w x q)] + ( MC x c 0> M = £< m- 

i 

Comparing this with the formula on page 51, we see that the quantity 

^ mjCj x (w x a) 
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is the same as the "rotational angular momentum", that is, the angular momen- 
tum of C around its center of mass. 

For the moment we only want to consider the basic aspects of solving equa- 
tions (F r igid) an d (t rigid)- To simplify matters, we'll ignore the motion of the 
center of mass, and just look for a solution of the form a — ^'(b,), essentially 
describing how the body rotates about the center of mass. 

Thus, we are looking for w so that (t r igid) holds when we have 

ct = vj = w x a . 

We have 

Ci" = (a)' X Ci) + (to X c/) 

= («' x a) + (« x (« x a)), 

so equation (x r igid) becomes 

x = y^ mjCj x («>' x a) + y^m,((o x (w x a)). 

i i 

In terms of the linear function \ c : R 3 — > M 3 defined by 

icCn) = ^mci x (t) x a), 

i 

we can write this as 

(*) I c (<•>') = r - ^ Wi(w x (w x a)), 

i 

where the right side depends on c e M and w. The only thing we need to check 
is that we can always solve this for some w' as a function of c and w, thereby 
obtaining a system of first order equations for w', and thus a system of second 
order equations for the elements of B. In other words, we need to know that 
the linear transformation I c is an isomorphism for all c e M. 

Since we only have to consider c e M of the form c,- = ^(b,) for some 
orthogonal P, we have 

I c (r,) = ^m ! P(b ! )x( 1) xP(b ! )) 

= P(£ ( m,b ( . xOP-HrOxb,)) 

Thus we only have to check that I = lb is an isomorphism, where 
I((|)) = ^ niibi x (<|> x hi) for <|> e M 3 . 
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Now for any i|r e ffi 3 we have 

= J^m/b, x (<> x b,), +) 

= ^w/^xbi, i|f xb,-> 

i 

= ^ w/(i|r x b,, <|) x b ( ) 

i 

= (I(+),*>. 

Thus I is self-adjoint, and consequentiy has an orthonormal basis of eigenvec- 
tors. Since 

(I(*),4>) = £>,|<|)xb ( | 2 , 

i 

the corresponding eigenvalues are all > 0, and in fact they are all > because 
we are assuming that b is non-planar, and thus at least one | <b x b/ 1 > 0. Since I 
always has positive eigenvalues, it is always an isomorphism, so we can indeed 
always solve equation (*) for w'. 

The map I = lb is called the inertia tensor of b. I should say that this is not 
how the inertia tensor is usually introduced (in fact, I don't know of any physics 
text that does introduce it this way), but it's the way it should be introduced! 

The directions of the eigenvalues of the inertia tensor are called the princi- 
pal axes of inertia, and the corresponding eigenvalues are called the principal 
moments of inertia. As our rigid body moves under the rotations i? _1 (i), the 
inertia tensor for (0(b) is just the composition B~ x (t)°\° B(t); the principal 
moments of inertia remain the same for all positions of the rigid body under 
the motion, while the principal axes of inertia are transformed by the B~ x (t). 

Aside from the values of F tota i and x , the principal moments of inertia are 
the only other data entering into our equations, so, in a sense, the whole motion 
of the rigid body b depends only on them. In particular, for motion under no 
external forces, we obtain exactiy the same equations for two rigid bodies of 
arbitrary shape, provided only that they have the same principal moments of 
inertia. 

We've described the inertia tensor rather abstractly, but it is not hard to write 
down the matrix of I with respect to the standard basis (ei, e2,e3), temporarily 
adopting the notation b, = (x/, J;, z;). 

I should remind you of the identity 

w x (u x v) = (w, v)u — (w, u)v 

which we can prove by noting that w x (u x v) is perpendicular to u x v, so it 
is a linear combination of u and v, and the appropriate coefficients are easy to 
determine using the usual identities for x. 
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With this identity at hand, we can write 

I(w) = J>K|bH 2 (o - (b/,<o)b/), 



which gives 



1(d) = ^in/(|b/| 2 ei - (b/,ei>bi) 

i 
i 

= ^m{yi 2 + z?, -Xiyi,-XiZi), 



with similar results for e 2 and e3. Thus, the matrix of I with respect to the 
standard basis (ei,e2,e3) is 



I = 



/ Hi m (yt 2 + zi 2 ) - Hi mxiyi - Hi mxiZi ^ 

- Hi myixi Hi ™i (xi 2 + z t 2 ) - Hi mym 
\ - Hi m i z i xi - Hi m zi yt J2i m i (xi 2 + yi 2 ) / 



The same result obviously holds for any orthonormal basis (vi,V2,V3) if we set 
Xi = (b,-,vi), yt = (b/,v 2 ), and z ( - = (b,,v 3 ). 

The diagonal terms of the matrix of I with respect to an orthonormal basis 
are the quantities that were classically called the "moments of inertia" of b 
about the axes; the off-diagonal terms are sometimes called the "products of 
inertia". In other words, the moment of inertia I a of b about an axis A is 



t< = £ 



min 2 , 



where r,- is the distance from b; to A. If our orthonormal coordinate system 
happens to point along the principal axes, then these moments of inertia are 
the principal moments of inertia. 

As we might expect, for a rigid body b a special role is played by the mo- 
ment of inertia about the axes that pass through the center of gravity C — 
( x C,yc,zc)- More generally, there is a simple relationship between the ma- 
trix I and the matrix 1' of the inertia tensor of b in a parallel coordinate 
system whose origin is C. Let 



Xi = Xi + x c , 



yt = yi + yc, 



Zi = Zi +z c , 



so that (xi,yi,Zi) are the coordinates of the rigid body in this system, and let 
M = Hi m i- When we write I in terms of the Xi, yi, z,-, any cross term like 
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E,- nii yi yc vanishes, since 

E/ m iy~i = E ( - miyt -yc) = E, myt - My c , 

which is by definition of yc ■ Thus, we obtain simply 



yc 2 + z c 2 -x c yc -xczc 



1 = l' + M- I rc-vc at 2 ! rc 2 -vc-c 
= i' + ic, 



-z c x c -zcjc xc 2 + yc 2 



where Ic is the matrix of the inertia tensor of the single body C with mass M 
around the origin of our original coordinate system. 

In particular, we have 

The Parallel Axis Theorem (Steiner's Theorem). If the point P is at distance 
d from the center of mass C of b, the moment of inertia of b about any axis 
through P is Md 2 plus the moment of inertia about the parallel axis through C. 

The parallel axis theorem is actually most useful when we make the obvious 
generalizations to a continuous rigid body B with density p. First of all, the 
total mass M is given by 



M 



~ j P~ j P( x >y> z ) dx dy dz, 



using x,y,z for the standard coordinate functions on R 3 . The center of mass 
is the vector given by 



C = — j p(x, y,z)(x, y, z) dx dy dz, 



B 

i.e., C is the point of M 3 with coordinates 
f x ■ p(x, y, z) dx dy dz, J y-p(x,y,z)dx dy dz, J z ■ p(x, y,z) dx dy dz. 

B B B 

The inertia tensor of B is the linear transformation whose matrix I with respect 
to the standard basis (ei, e2,e3) is given by 

' f p-(y 2 + z 2 ) dx dy dz —Jp-xydxdydz —Jp-xzdxdydz ^ 

I B B 

— Jp-yxdxdydz J p-(x 2 + z 2 ) dx dy dz —fp-yzdxdydz 

B B B 

— Jp-zxdxdydz —fp-zydxdydz f p-(x 2 + y 2 ) dx dy dz 

B B B ) 
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A straightforward double integration, left to you, enables us to find the mo- 
ment of inertia of a cylinder about the axis through its center. The parallel axis 




theorem then allows us to compute the moment of inertia around a parallel axis 
that goes through the edge, a result that we will mention in the next lecture. 

Moments of inertia play a crucial role when we consider a rigid body whose 
motion is a rotation about an axis. For a unit vector u pointing along this axis, 













7 c 







if we decompose each c,- as 

Cj = Cj + (C;,U)U 

where c,- is in the plane perpendicular to u, then |c;| is the distance from a 
to the axis. The tangent vector 6/ is in the same plane as c,- and perpendicular 
to it, and if 9{t) is the angle through which the body has rotated at time t, then 
the length of c/ is r,-0'. So 

ct x c/ = rfO' ■ u, 

and it follows easily that 

(q x c/,u) = (cj x c/,u) = r, V, 

and thus 

(c i xc i ",u) = (c i xc i ',uy = r i 2 d". 
So equation (x r igid) gives the u component of x as 

(taxis) (x,u) = I A -e". 

Now we ask whether a rigid body can rotate about an axis if there are no 
external forces. The block shown below can rotate about the three axes of 
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symmetry. You might naively expect that if it were provided with the right 
initial push it could also rotate about any other axis that passes through the 
center of mass of the block, as illustrated in the right hand part of the figure, 
but a little thought should be able to convince you otherwise (note that the 
angular momentum vector won't be constant). 

In general, if there are no external forces on our rotating body, then we 
have 0" — in equation (taxis), so that the body rotates with constant angular 
velocity 9' = a. Since we now have 

c{ = an x Ci , 

we get 

Ci" =aux a' 

= a 2 u x (u x a) 
= a 2 [(u,Q)u- Ci], 

so that 

d x c(' = — a 2 (u,c,)(u x a). 
Since there are not external forces, equation (t r igid) thus gives 

= ^m, (c ( -,u)(u xa). 

i 

Since 

I c (u) = ^ntiCi x (u x a) 

i 

= ^2 m ' \ c ' I 2 " - ^2 m i (°i ' a ) c i > 

i i 

this then shows that 

u x I c (u) = 0, 

so I c (u) is a multiple of u, and u must be an eigenvector of I c . In general, a 
rigid body has just three axes around which it can rotate without external forces, 
and then the angular velocity must be constant. 
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Finally, for use in the next lecture, I want to point out that equation (taxis) 
holds even in the more general case where the motion of a rigid body is the 
result of combining rotation about an axis A with a motion of this axis parallel 




to itself. In fact, this just changes the c,- to 

ct + au + v, 

for some functions a and v, with v always perpendicular to u. It is then easy to 
check that 

((d +au + y)x {a" + a"u + £"v),u) = ((a x c t "),u), 
because each of the other terms in the expansion is 0. 
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Though many problems of elementary physics involve a rigid body this rigid 
body is usually subject to some "constraint", like a block that slides along an 
inclined plane, or a wheel that rolls along it. Simplest of all is the pendulum, a 
rigid body (the pendulum bob) constrained to move in a circular arc by a string 
attached to some fixed point. 

We analyzed a pendulum in the first lecture, but that treatment was necessar- 
ily somewhat incomplete, because a pendulum is actually an abstraction, just 
like a rigid body. If we release a raised pendulum bob (conveniently regarded 
simply as a particle, or point mass), then it will start to fall downwards, rather 
than along the arc of the circle, stretching the string a bit, and then this stretch- 
ing will cause the string to exert an upward force that pulls the string a bit above 
the arc of the circle, and so on. 

The circular arc along which the pendulum bob supposedly moves plays the 
role of the "configuration space" M in our treatment of rigid bodies — it rep- 
resents the set of all positions that the pendulum bob can reach under the 
constraint that it remains at a fixed distance / from the pivot point (we are also 
assuming that the pendulum bob is moving in a plane, to make everything a lot 
easier). 

In addition to the external force F = gm of gravity on our pendulum bob 
of mass m, there is an "internal" force Fi on the pendulum bob that is exerted 
along the string, and the law of motion is 

mc" = F + Fi 

or 

mc" -Y = Fi. 

But the string is always perpendicular to M, so this implies that 
(a) (v, mc" - F) = 

for all v tangent to M, just as in d'Alembert's Principle. If 6 is the obvious 
coordinate system on M, and we allow the usual abuse of notation of letting 6 
also be 9 o c for our particle c, then when we apply this equation to v = 9/9(9 
we get the same equation 

(P) e" + j sin e = o 

as before. 
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In the case of our earlier derivation of (the easy direction of) the principle of 
virtual work for a rigid body, we had to use the equations 

(c,(0 —Cj{t), a(t) —Cj(t)) — constant 

defining a rigid body to derive the equation 

J>*,F*) =0, 
k 

But in the case of the pendulum, the analogous equation (a) followed directly 
from the nature of the problem — the internal forces were automatically per- 
pendicular to the tangent space of M. In fact, "constraint problems" basically 
mean ones in which the internal "constraint" forces have this properly. 

We can sum this up as 

d'Alembert's Principle for Constraints: If the constraints on a system 
confine the system to a configuration space M, and are perpendicular 
to M, then the motions of the system under the external forces F satisfy 

(F — mc", v) = for all v tangent to M c . 



Another difference between the case of a rigid body and constraint prob- 
lems is that the internal forces for a rigid body are generally not unique, and 
d'Alembert's Principle allows us to solve for the motion without having to find 
the internal forces. On the other hand, "constraint" forces usually are unique, 
and easy to determine once the problem has been solved. 

Let's now consider the "compound pendulum", which is basically a pendulum 
that is a thin plate, of arbitrary shape, oscillating about a fixed point Co. Now 
we have the constraint that our pendulum is a rigid body, together with the 
constraint that the point Co is fixed. 




Thus we are assuming that the constraint forces keep the point Co at dis- 
tance from some point P. This is a bit weird, of course, since this constraint 
force supposedly acts along the line between Co and P, which doesn't tell us any- 
thing. Nevertheless, even though we can't specify the direction of this constraint 
force C, we will still have (C, Vo) = for all virtual infinitesimal displacements 
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v = (vo, . . . ,Vk) of our pendulum under this constraint, for the simple reason 
that Vo = 0, since the constraint keeps Co fixed. 

For this problem our configuration space is again a circle, although now we 
don't think of it as a circle of a particular radius, but simply as the collection of 
angles 9 through which our pendulum can rotate. Restricting ourselves to this 
configuration space takes care of the constraint that Co stays fixed, and we've 
already analyzed all the other constraints that make the pendulum a rigid body; 
and we just have to apply our equation (taxis) from Lecture 5. 

Since the force on particle c ; - is gm,u, where u is the unit downward vector 
field, we have 

t = J2i a x gmu 

= gM ■ C x u, 

where C is the center of mass of the pendulum. This means that x points in 




the direction of the axis A, and has magnitude 

gMl sin 9, 

where 9 is the angle of C with the vertical, and / its distance from the pivot. 
So equation (taxis) becomes 

0" + iMl s inO = 0. 
Ia 

Comparing to the formula (P), we see that our pendulum acts precisely like 
a single bob pendulum whose distance from the pivot is 

Ja_ 
Ml' 

If Ic is the moment of inertia about the center of gravity C, then by the parallel 
axis theorem we have 

U_ = Ic + Ml 2 Iq/M 
Ml Ml I 

Introducing the radius of gyration k by 

Ic = Mk 2 , 
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we find that our pendulum acts precisely like a single bob pendulum whose 
distance from the pivot is 




When our rigid body pendulum consists only of the pivot and a single particle 
at C, we have k — 0, but in any other case the pendulum will have a longer 
period. 

A good illustration of the method of configuration spaces is provided by a 
problem that becomes quite complicated with standard elementary analysis. 
Elementary physics courses almost always analyze a block sliding down an in- 
clined plane, though it may not always be emphasized that this plane is pre- 



sumed to be immovable, rather than an object that can itself slide horizontally 
along the floor. 

The force F of gravity on the block is decomposed into a force F2 parallel to 
the inclined plane, and a force Fi perpendicular to the inclined plane, where 
F2 supposedly doesn't act on the inclined plane, because we are assuming that 



the block slides without friction. We reason, from the third law, that since Fi 
is the force of the block on the inclined plane, the inclined plane must exert 
the force — Fi on the block. Here is where we are using the hypothesis that 
the inclined plane is stationery: Fi determines the acceleration of the block in 
the direction perpendicular to the inclined plane, but that must be (since the 
inclined plane is not moving and the block slides along it), so the inclined plane 
must be exerting a force of — Fi on the block. 

We thus find that the force F2 parallel to the inclined plane has magnitude 
mg sin a, so if c(t) is the distance that the block has traveled along the inclined 





h 
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plane after time t, then c" — g sin a, and our block slides with a uniform accel- 
eration that is sin a times its free fall acceleration. 

But things are quite a bit more complicated if we allow the inclined plane 
to be a wedge that slides along the horizontal plane, without friction. The 
force Fi that the block exerts on the wedge can no longer be obtained simply 
by resolving the downward force of gravity F into forces perpendicular and 
parallel to the wedge, because our identification of Fi with the perpendicular 
component depended on the wedge being fixed. 

Choosing the unit vector ej = (1,0) parallel to the floor and the unit vector u 
parallel to the slope of the wedge, we let Aei be the acceleration of the wedge, 
of mass M, along the horizontal plane, while an is the acceleration of the block, 
of mass m, along the wedge, so that aa + Aei is the acceleration of the block in 



our inertial system. Note that in our picture, we actually have A < 0, so that 
the arrow Aei points in the opposite direction, since the force Fi causes the 
block to slide to the left. 
Breaking up the equation 



for the motion of the block into the components that are parallel and perpen- 
dicular to the slope of the wedge gives 



The force on the wedge, of mass M, is Fi plus the gravitational force down- 
ward, plus whatever upward force the horizontal plane must exert to keep the 
wedge from moving downwards. So A is determined by the horizontal compo- 
nent of Fi: 




F 



(1) 



Fi + F = m(aa + Aei) 



(la) 
(lb) 



mg sin a = ma + mA cos a 
|Fi | — mg cos a = mA sin a. 




|Fi| sin a = MA. 




and then (la) gives 



a = g sin a — A cos a. 
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To analyze this problem using configuration spaces, we regard our system 
as consisting of two "particles", the block c m and the wedge cm- Since the 



cm 



wedge Cm always stays on the horizontal axis, we'll simply consider our problem 
as occurring in (l 2 )xl, with ((a, b), x) representing the particle c m at the point 
(a,b), and the particle cm at x. 

Our configuration space M consists of all ((a,b),x) which represent the 
block c m resting on the wedge cm [i-e., for which we have b = (x — a) cos a]. 
A convenient coordinate system on M is provided by the coordinate x e R 
giving the position of cm, together with the distance s of c m from the top of 
the wedge. 

To determine d/ds, we keep X fixed and vary s, obtaining a curve in M whose 
R 2 component moves down the slope of the wedge, while its M component is 
fixed, so for the unit vector u in R 2 parallel to the slope of the wedge, we have 

On the other hand, if we keep s fixed and vary x, then we obtain a curve in M 
whose K 2 component moves parallel to the first axis along with its R component, 
so for ei = (1, 0) we have 

a 

— = ( ei ,i). 

dx 

Now if s{t),x(t) are the functions describing the motion of c m ,CM, we have 



so 



c m (t) = s(t)a + x(;)ei e M 2 
c M (t) = x(t) e R, 



c m " = s"u + x"ei g R 2 
c M " = x" e R. 



The external forces F m on c m and Fm on cm are given by 

F m = -mge 2 
F M = 0, 
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so our condition for a solution is that 

{-mge 2 - mc m ", Vi) + (0 - Mc M ") • v 2 = 

for all v = (vi,V2) tangent to M, where ( , } is the usual inner product in the 
first factor R 2 , while the inner product in the second factor M is just ordinary 
multiplication. Choosing 

9 3 
v= — = (u,0) and then v = — = (ei, 1) 
OS ox 

gives the two equations 

= (—mge.2 — ms"u — mx"e\ , u) — Mx" ■ 
= (—mge.2 — ms"u — mx"e\,e\) — Mx" ■ 1, 

which amount to the equations 

= mg sin a — ms" — mx" cos a 
= —ms" cos a — mx" — Mx" . 

Solving for x" and s" gives us the same results that we obtained previously, 
when they were called A and a, respectively The first of the above equations 
is precisely (la), while the second is a combination of (la), (lb), and (2). 

Finally, let us consider the problem of rolling wheels. Physics books that 
discuss rolling in any reasonably detailed way point out that rolling depends on 
a truly paradoxical fact: a wheel only rolls because of frictional forces, ones that 
"oppose sliding": a wheel not only displays the characteristics of our abstract 
rigid body, but it also has the strange feature that it is affected by a frictional 
force that is exerted only at the (always changing) contact point of the wheel on 
the inclined plane. To make the picture even more confusing, we can somehow 
ignore this frictional force because, the physics books note, the path followed 
by any point on the circumference of the wheel has velocity at the moment it 
hits the plane! 

It is indeed well known that a cycloid, the path followed a point on the cir- 
cumference of a wheel, has velocity at the point of contact. Physics books 
seem to regard this as intuitively clear, but for those of us not endowed with the 
requisite physical intuition, here is a proof, for the general case of one surface 
rolling on another. 
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Proposition. Consider two surfaces M and M in K 3 that are tangent at a point 
p. Let c be a curve in M, and c a curve in Af such that c(0) — p — c(0), and 




such that c'(0) 7^ is a multiple of c'(0). For each ? let A{t) be the rigid motion 
for which 

(a) A(t)(c(t)) = c(0, 

(b) y4(0(M) is tangent to M at c(0, 

(c) A(t){c'(t)) points in the same direction as c(t), so that 

A(t)(c'(t)) = a(t) • c'(t) for some function a. 

Also, for each point c(t) on the curve c, let y T be the curve that this point 
follows under these rigid motions, 

Y r{t) = A{t)(c{x)). 

Then for all t we have 

a(0 = 1 <=> y/(t) = 0. 

Consequently, or(/) = 1 for all t, so that the lengths of c and c are the same on 
any time interval [to, h], if and only if all Yt (?) = 0, so that y t has velocity at 
the time that it hits M. 

Proof. Write A(t) in the form 

A(t)(x) = B(t)(x) + w(0 x e R 3 , 

for orthogonal B(t). Setting x = c(t) and using A(t)(c(t)) = c(t), we see that 
w(/) = c(t) — B(t)(c(t)), so we can write 

A(t)(x) = B(t)(x) + [c(0 - B(t)(c(t))]. 

Lhe definition yo(t) = A(t)(c(0j) = A(t)(p) gives 

yo(t) = B(t)(p)+c(t)-B(t)(c(t)). 
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Since B(0) is the identity, differentiation gives 

yo'(0) = B'(0)(p) + c'(0) - B'(0)(p) - c'(0) 
= c'(0)-c / (0) 
= c'(0)-a(0)-c'(0). 

So a(0) = 1 if and only if y '(0) = 0. 

Our hypotheses on ^4(/) then allow us to use this same argument at any point 
c(t) by considering the reparameterization t h s- t + r. 

For the case of a wheel rolling down an inclined plane, we can provide a 
picture that both reinforces this geometric information and also allows us to see 
"what is going on" physically. We regard our circular wheel as a polygon with 
a very large number of sides, and suppose that initially, in position 1, it is lying 




on the inclined plane along the segment AB. Now, instead of sliding down the 
plane, it rotates about the point B, reaching position 2 when vertex C hits the 
inclined plane, at C . Then it rotates around C' , and so forth. 

Let's return to the special case of a wheel rolling down an inclined plane, 
once again assumed to be immovable. Our "wheel" is really supposed to be a 
cylinder, so that it is forced to roll along a straight line, but the 2-dimensional 
cross-section picture provides all the interesting information. The standard el- 
ementary treatment of this problem is the following: 
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We consider a wheel of radius R and mass M and uniform density, and let 
4>{t) be the angle through which it has rotated after time t. For a unit vector u 
pointing down along the inclined plane, we will let a ■ u be the acceleration of 
the center of mass, and let — / u be the frictional force along the inclined plane 
at the contact point of the wheel and the plane. The total force on the wheel is 
the sum of the downward force of gravity, a constraining force Fj perpendicular 
to the plane, which keeps the wheel from moving perpendicularly to the plane, 
and the frictional force — / • u. 

For the acceleration a ■ u of the center of mass we have 

(1) Ma — Mg sin a - f, 

since Mg sin a is the magnitude of the component of the gravitational force 
along the inclined plane, while the constraining force Fj is perpendicular to u. 

For the rotational motion about the center of mass we can apply equation 
(taxis) to the axis through the center of mass that is perpendicular to the plane 
of the drawing to get 

(2) H" = Rf, 

where / is the moment of inertia of the wheel about its center of mass. 

Finally, the fact that our wheel is rolling tells us that the distance traveled by 
the center of mass at time t is equal to R ■ 4>{t), which means that 

(3) a = R ■ <p". 
Solving (1)— (3) gives 

1 

a = g sin a — 



1+ 1 



R 2 M 

and substituting into (1) gives 



/ = Mg sin a 



R 2 M +/' 



This indicates the amount of frictional force that the inclined plane must be 
able to produce in order to prevent sliding. As a general rule, the frictional 
force that a body produces on an inclined plane is proportional to the normal 
component of the downward gravitational force, i.e., it equals \x ■ Mg cos a for 
a constant ft, the "coefficient of friction". So to prevent sliding at the angle a 
we need to have 

J 

jJL ■ Mg cos a = Mg sin a ■ 



R 2 M + I 
or 

/ 

H, = tan a ■ — — — . 

R 2 M + / 
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Thus we need a "perfectly rough" surface, with "/jl = oo" if we want to prevent 
sliding at any angle. 

For a homogeneous disc of radius R and mass M we easily compute that 
/ = jMR 2 , so we have 

a = |g sin a. 

Thus the wheel rolls down the incline plane at only 2/3 of the speed that a block 
slides down a frictionless inclined plane. 

Even though a rolling wheel involves friction, it is still a natural candidate 
for treatment by d'Alembert's principle, for the same reason that the standard 
elementary treatment ignores the effect of friction: If we consider our wheel as 
a rigid body made up of a large collection of particles, and let v be any virtual 
infinitesimal displacement of the wheel, then the inner product (f, v p ) of the 
frictional force f and the velocity \ p at the point of contact p is always since, 
as our Proposition shows, v p — 0. 

Thus, the situation is quite like that of the compound pendulum. The only 
real difficulty is that we have a sort of hybrid between our initial pendulum bob 
problem, where we considered a single particle acted upon by a constraint force, 
and the compound pendulum, where almost all our constraints had already 
been considered in the analysis of rigid body motion. We really need to think 
of our wheel as representing two different "particles" s and 4> m ^> 

s = position of the center of mass 

<p — angle through which wheel has turned, 

having the respective masses 

M — the total mass of the wheel 
/ = the moment of inertia of the wheel. 

We thus have a problem in R 2 that is reduced to a problem on a 1 -dimensional 
submanifold M C M. 2 by the rolling condition 

0(0 = s(t)/R. 

On M we have the obvious single coordinate s, the distance along the inclined 
plane, and the corresponding tangent vector to M represents the pair 

(1,1/7?). 

We are looking for a function s(t) such that 

(s(t),<p(t)) = (s(t),s(t)/R) 
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satisfies 

= ( (-F, - Ms", -/</>"), (U/*)) 
= ( (-F, - Ms", - Is"/R), (1, 1/7?) ), 

where ¥ s is the component of the force on the center of mass that is parallel to 
the inclined plane. Thus we get 

= Mg sin a - Ms" - (Is"/R) ■ 1 /R 
= Mg sin a - Ms" - Is"/R 2 , 

which gives the same result 

• 1 
s = g sin a — 

1 H — 

R 2 M 

that we obtained previously by combining three equations. 
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HOLONOMIG AND 
NON-HOLONOMIG CONSTRAINTS 



In the last lecture we examined "constraint" problems where the constraints 
restricted our solutions to lie in a "configuration space" M that was a sub- 
manifold of the larger space for which the problem was originally posed. Our 
method of solution basically used the obvious principle that if you are looking 
for the solutions of a differential equation on a manifold M, and you know that 
the solution lies on a submanifold M C M , then you might as well just consider 
what the equation says on M, thereby obtaining an equation in fewer variables. 

Physicists call such constraints "holonomic". They also mention various sorts 
of other constraints, like the constraint that particles remain within a given 
box, which are expressed by inequalities or more complicated conditions, and 
obviously require special considerations in each individual case. But there is 
one other very important sort of "non-holonomic" constraints that allows a 
systematic treatment. 

The standard example of this kind of constraint is provided by an upright 
disc rolling on a plane. The possible positions of the disc are determined by the 



coordinates (x,y) of the point at which the disc rests on the plane, the angle 4> 
that a fixed point on the disc makes with the vertical, and the angle 9 that the 
plane of the disc makes with the .x-axis. 

This example is rather idealized. To begin with, in order for the disc to 
remain upright, we might imagine that it has a companion disc attached to it 
by an axle. We will want to assume that the axle and the companion disc both 
have negligible weight, and it is also important that the two discs be able to 
rotate independently about this axle, so that, for example, the disc can revolve 
around a circle, with its companion "shadow" disc revolving around a circle of 




plane of the disc 
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a different radius. In addition, although our disc has to have some thickness, 
we want to imagine it to be so small that it actually can roll along a circle — or, 
indeed, along any path — rather than being constrained to roll along a straight 
line. 

In the simplest case, where there are no external forces, it is easy to guess from 
the symmetry of the situation that the disc of mass m and angular momentum L 
will roll with constant speed along a circle of radius m/|L|, or along a straight 
line when L = 0. But that doesn't suggest a general method for solving the 
problem where there are external forces, for example if we tilt the plane, so that 
now the force of gravity is only partially offset by the constraining perpendicular 
force of the plane. 

Unfortunately, we are stymied when we try to use our method of configuration 
spaces to reduce the problem to one in fewer variables. Starting with our disk at 
a point (xo, yo), we can roll it, as in (a) to a nearby point (x\, y\) along paths of 



the same length that all start in the same direction at (xo, yo) but reach (x\, y\ ) 
at different angles. This means that we obtain a whole interval of possible 9 
values at some particlar value of x\, }'\, and (pi. Moreover, we can also roll 
it, as in (b), along paths of different lengths that all have the same direction 
at both (xq, yo) and (x\,y\), thereby obtaining a whole interval of possible 4> 
values at a particular value of X\, y\, and 9\. Thus, the proper configuration 
space for this problem is a whole neighborhood of M 2 x S 1 x S , rather than 
a lower-dimensional submanifold. (The case of a sphere, or other convex body, 
rolling on a plane, presents exactly the same problem, with fewer idealizations 
necessary, but it presents a problem even harder to analyze than the disc.) 




Oo, yo) 



(x ,yo) 



(a) 



(b) 
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This phenomenon is a reflection of a simple fact about the relations be- 
tween the coordinates of our disc moving in the space with coordinate functions 
(x, y,<p,9). Letting x(t), y(t), (p(t), 9(t) denote the components of the coordi- 
nates of the disc, the velocity of the center of mass is Rcj)' ', where R is the radius 
of the disc, and consequently we have 



(1) 



x' — Rcj)' cos i 
y' = Rep' sin t 



This means that the tangent vectors of the curve satisfy 

(10 



= dx — R cos 9 dcp 
= dy — R sin 9 d(j), 



In particular, they therefore satisfy the condition 

dy — tan 9 dx — 0. 

This determines a 3-dimensional subspace of all tangent vectors at each point, 
but this 3-dimensional distribution isn't integrable, as we can easily see from the 
standard integrability conditions. To apply the differential form version of the 
Frobenius integrability theorem, for example, we simply note that the 2-form 

d(dy — tan 9 dx) = sec 2 9 d9 f\dx 

isn't in the ideal generated by dy — tan 9 dx. Equivalently we can note that the 
distribution is spanned by the vectors 



9 9 9 



30' 



but the bracket 



9 

dx 



tan 9 



9 9 



— — sec 



dy' 89 . 

obviously cannot be written as a linear combination of the three vectors X\, X2, 
and X3. 

Thus, although we have a condition that must be satisfied by tangent vectors 
to a solution curve, we can't select a 3-dimensional configuration space on which 
the solution curves must lie. We can only say the following: 

We must have (F — mc", v) = 

for all v e ker(<ix — R cos 9 d(j>) ("1 ker (dy — R sin 9 dcj)). 

Here F is evaluated at (c(/), t) and c" is evaluated at t, while dx — R cos 9 d<p 
and dy — R sin 9 d(j) are evaluated at c(t). 
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More generally, if the conditions in (1') are replaced by the vanishing of certain 
1-forms u>\,. . . ,(Dl, then 

We must have (F — mc" , v) = 

for all v e ker a>\ fl • • • fl ker col ■ 

In terms of the linear functional 

O(v) = (F-mc",v) 

this condition says that 

ker O D ker ft>i ("1 • • • R ker . 

We can now appeal to the very same vector space fact was used in our proof of 
the Principle of Virtual Work, in Lecture 5, and conclude, applying the argu- 
ment at each point, that 

$ = XiU>\ H h X L a> L 

for some functions X\, . . . ,Xl, known as Lagrange multipliers. This leads us to the 
following criterion for solutions: 

d'Alembert's Principle for Differential Constraints: If the constraints 
on a system require the tangent vector of the motion to lie in the 
subspace ker(ft>i) fl ••• fl ker(<y£,), then there are Lagrange multipli- 
ers A,i, . . . , Xl such that the motions of the system under the external 
forces F satisfy 

(F-mc",v) =A 1 « 1 (v)H hX L a) L iy) 

for all tangent vectors v at c. 

To apply this to the problem of an upright disc rolling on a plane, where we 
have the relations 

x' = R(/)'cos9 

® y' = R(p'sm6. 

and 

= dx — R cos 9 d(h 

(10 

= dy — R sin 6 d<j), 

we note that, as in the case of the rolling wheel, we are not dealing with a 
single particle c(t); in the present situation we have to think of the disc as three 
different "particles", the particle (x, y) with mass M, the particle </> with mass /, 
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the moment of inertia of the disc about the axle, and the particle 9 with mass Iq, 
the moment of inertia of the disc about a diameter. We thus have 

(*) {(-Mx",-My",-I e 8",-I(/>"), v) 

= A,i idx — R cos 9 d(j))(y) + Xi(dy — R sin 9 d(j>)(\) for all v. 

Taking v = d/dx, d/dy, 3/3(9, and 3/30, this gives us the equations 



(2x) -Mx" = A.i 

(2y) -My" = k 2 

{29) I 9" = 

(20) I4>" = X x Rcos9 + k 2 Rsm9. 



Differentiating our original constraint equations (1) gives 

x" = i?0" cos 9 - R<f>'6' sin 9 
y " = #0" sin + J R0'6> / cos 9, 

so substituting (2x) and (2y) into (20) gives 

(3) 70" = -Mi?[(i?0" cos 9 - Rcp'9' sin 0) cos 9 

+ (R<p" sin + R(j)'9' cos 6>) sin 9] 

= -MR 2 4>". 

Thus (7 + MR 2 )4>" — 0, and 0' is constant. 

Equation (20) shows that 0' is also constant, and if we substitute the two 
expressions 

0(0 =at + b 
9(t) =ct +d 

into (1) and solve, we find that (x,y) moves along a circle of radius Ra/c for 
c 0, or a straight line if c — 0, in which case 9 is constant. 

We can apply the same analysis to the more interesting case where our disc 
is rolling down an inclined plane with slope a. The only change to (*) is that 
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the term —Mx" must be replaced with Mg sin a — Mx" . In the set of equa- 
tions (2), the only change is that equation (2x) is replaced by 

gM sin a — Mx" — \\ . 
Proceeding as before, (3) then becomes 

(/ + MR 2 )(j)"(t) = (MgR sin a) cos 9(t) 

= (MgR sin a) cos(c? + d), 

which, introducing an appropriate constant A, we write simply as 

(j)"(t) — Acos(ct + d). 

In the special case c — 0, the angle 8 will be constant. This case is essentially 
just the same as the case of a wheel rolling down an inclined plane: the disc 
rolls down the inclined plane along the straight line that makes a constant angle 
with the x-axis. 




For c 7^ 0, we might as well take d = 0, since this just amounts to changing 
the point from which <p is measured, so there are constants B and C with 

</>'(/) = Bsm(ct) + C. 

In the special case where C = 0, equation (1) gives, for a constant D, 

x'(t) = Dsin(ct) cos(c0 = \Dsm(2ct) 

y'(t) = Dsm 2 {ct) = \D{\ - cos(2ct)). 

Thus x is basically sinusoidal, while y is a sinusoidal motion combined with 
an additive motion. Somewhat like a pendulum bob, the disc is continually 
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interchanging potential energy and kinetic energy. At the lowest points of the 
trajectory, where the kinetic energy is greatest, the disc is moving parallel to 
the j-axis, where it has just enough kinetic energy to propel it back up to the 
highest level. 

More generally, we have constants E and F with 

x' — Esin(2ct) + F cos(ct) 

y' = E(\ - cos(2cf)) + F sin(ct), 

and if you have a nice computer program for producing graphs you can find 
out what the most general solution looks like. 

The Lagrange multiplies A.,- that occur in d'Alembert's principle for differ- 
ential constraints may seem to have appeared out of the blue, but they can be 
interpreted in terms of the constraint forces C on our system S. In fact, consider 
two systems: 

(a) the system S with constraints C and external forces F, 

(b) the system S with no constraints and external forces F + C. 

The systems (a) and (b) obviously have the same solutions. But the solutions 
to (a) satisfy 

(¥ — mc",y) = Xiu>i(y) H h Al&>l(v) for all v, 

while the solutions to (b) satisfy 

(F + C- mc",y) = for all v; 
subtracting the first equation from the second, we find that we must have 
(C,v) = -Q,i(o x + --- + k L (o L )(y) for all v. 

By writing out XiCOi H h XlO)l in terms of the coordinates, we can then find 

all the components of C. 

For example, in our original problem of the disc rolling on a horizontal plane, 
where we have 

k\0)\ + A.2&>2 = k\ (dx — R cos 9 d(j)) + kiidy — R cos sin 8 d(j>) 
= k\dx + A.2 dy — {k\ R cos 9 + sin 6) d(f> 

we find that the components C x and C y are given by 

C x = (C, d/dx) = -Ai 
C y = (C, d/dy) = -k 2 
C e = (C,d/dd) =0 

C = (C, 9/90) = R(ki cos 6 +k 2 sin 6). 
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Thus, the constraint forces can be found in terms of k\ and A.2, which we 
can determine from (2x) and (2y) once we've solved explicitly for x{t) and 
y{t). The x and y components together, the vector (x"{t),y"{t)), represents 
the constraint force on our "particle" (x(t), y(t)), and thus the factional force 
exerted by the plane to keep the center of mass in its circular orbit. Since the 
center of mass moves in a circle with constant angular velocity, (x"(t), y"(t)) 
is always perpendicular to the velocity vector v = (x'(t), y'(t)) of the center of 
mass, as we would expect. 

The 4> component of the constraints, 

-R(x" cos (9 + y" sin 6), 

represents the constraint force on our "particle" <p, the frictional force exerted 
by the plane against the direction of </>; it is the additional frictional force, in 
the direction of the velocity vector, that is needed to insure that the wheel rolls. 

In the case of holonomic constraints, we didn't need to use the Lagrange mul- 
tipliers A.,- , but we can use them, if we want to obtain the constraint forces. For 
example, consider the rolling wheel problem, which we treated by the method 
of configuration spaces on page 83. Now we will simply use the coordinates s 
and 4> an d the relation 

s'{t) = R(f>'{t) 

between their derivatives. We then have the following condition for all v: 

{Mg sin a - Ms" - I<p", v) = k(ds - Rd<p)(\). 
Taking v = d/ds and then v = 3/30 we get 

(a) Mg sin a — Ms" = k 

(b) -/(/>" = -Rk, 
so 

Mg sin a - Ms" — —4>" 
R 

and differentiating the constraint s' = Rcj)' gives s" — R(/>", so this becomes 

Mg sin a — Ms" — ~^s" , 
R 

with the same solution 

• 1 
s — g sin a — 



1 + 



R 2 M 

as before. Substituting back into (a) then gives 

k = Mg sin a - 



R 2 M + /' 

which agrees with the formula for the frictional force / on page 82. 

Of course, we can easily formulate and use a "mixed" version of d'Alembert's 
principle, where some of the constraints restrict our system to lie in a configu- 
ration space M, while other constraints restrict the tangent vector of the system 
to be in the kernels of various 1-forms. 
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STATICALLY 
INDETERMINATE STRUCTURES 



Although we've naturally emphasized constraint problems that have solutions 
using only our basic consideration of rigid bodies, it is easy to pose problems of 
quite another sort. 

For example, suppose that we have a rigid plank of weight W resting on three 
identical rigid supports. What are the upward forces exerted by these supports 



on the plank? 

It is easy to see that there simply is no well-determined answer to this question. 
For example, one solution is that each support provides an upward force of 
W/3, while another possible solution is obtained by simply ignoring the middle 
support, and thus assigning an upward force of W/2 to each of the end supports. 

Of course, in reality the solution is not indeterminate — one can actually mea- 
sure the upward forces and obtain a specific answer; and, in reality, the plank 
actually isn't rigid, but bends a bit. For some reason such problems never seem 
to be addressed in physics course nowadays, even though they would seem to 
be wonderful examples of actually applying the basic laws of physics. They 
are apparently relegated to courses on "material science" or "mechanical engi- 
neering", and I only found a simple discussion in the delightfully old-fashioned 
book [S-G]. 

Before tackling this problem, let's look at a simpler one, just to see how 
some simple physics can be applied to problems of this nature. Consider three 



A 



C 



B 



A 



C 



B 



A 



C 




D 



D< 



(a) 



(b) 
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filaments, of the same material, arranged as in (a), so that ABD is an isosceles 
triangle, and CD is its altitude. A weight W is hung from the end, as in (b), 
so that the side filaments exert a force of magnitude P along their directions, 
while the middle filament exerts a force of magnitude Q. These forces actually 
come about because the filaments stretch slightly as in (c). 

We assume that the filaments obey Hooke's law: If the filament is fixed at one 
end and the force F pulls on the other end, then 



|F| — X ■ — for some constant X, 

where / is the length of the unstretched filament, and A/ is the increase in length 
(this holds only for a certain interval of A/ values). 

We could use geometry to find the length AD' — AD of the left filament in 
terms of the change in length 8 — DD', express both P and Q in terms of <5 
and the constant X, and then use 2P + Q — W to find <5, leading to rather 
messy formulas for P and Q, in terms of X. But it is much easier to see what 
the limiting values are for s — ► 0. 

Suppose we choose point E so that AE — ED. Then 

P _ ED' CD' 
Q ~ ~AD' ' ~DD' 
_ ED' CD' 

~ ~DD' ' ~AD' 

ED' . 

= sin a. 

DD' 



But the angle DED' is ~ 7t/4, so the first ratio is ED' /DD' ~ sin a, and we 
conclude that 

P ■ 2 

— ~ sm a, 

and using 2P + Q we can then solve for P and Q. 

Our original plank problem is similar, but rather more complicated. If the 
plank were perfectly rigid, the supports would all have to be compressed by the 
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same amount, and thus all have to provide an upward force exactly 1 /3 of the 
weight of the plank. But we are interested instead in analyzing the effect of the 
plank's bending, which results in different compressions of the supports, and 
thus different upward forces. In fact, we aren't interested in the extremely tiny 
compression of the supports at all, only in the upward forces that they will have 
to provide to balance the bent plank. For simplicity we consider "knife edge" 
supports, which touch the plank along a line, appearing as a single point in our 



I I I 



2-dimensional section. 

Hooke's law also holds for a plank or rod, except that it is stated a bit dif- 
ferently, because we want to take into account the cross-section A, which was 



A/ 
I 



3? 



area A 
F 



essentially assumed to be for the case of a filament. So we consider the ratios 



stress a = 



A ' 



strain s 



and write Hook's law in the form 

o — Es 

for a constant E, the modulus of elasticity. Of course, A actually changes a bit 
when the force F is applied, but the change is so minute that it is disregarded. 
It should also be noted that the modulus of elasticity for stretching might not 
be the same as that for compression (concrete is supposed to be an example), 
but we will be only consider the case where they are the same. 

For steel, E ~ 29 x 10 6 psi (pounds per square inch), while for molybdenum, 
E ~ 49 x 10 6 psi. For wood E varies between ~ .6 x 10 6 psi and ~ 1.7 x 10 6 psi, 
depending on the type and grade of wood, the direction of the load, etc. 

Now consider a long plank supported by various knife edge supports. The 
plank is actually going to sag a small amount, so that viewing it head on we 
see something like the picture shown below. To the left of the dotted line, there 
is a compressing stress along the top and a stretching stress along the bottom, 
while to the right of the dotted line just the opposite is true. The stress is thus 
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at some intermediate surface, the neutral plane, whose profile, shown as a heavy 
line in the figure, is the graph of some function /. The figure also shows a cross 
section of the "fibre" through (x, f (x)), that is, the surface into which a vertical 
section of the plank is deformed. 

The figure below is a greatly enlarged view of a small portion of the figure 
near the point (x, f{x)), bounded by two fibres. While the heavy line is the 




graph of /, the cross-section of the neutral plane, the dotted line indicates the 
cross-section of a surface where the stress has the constant value y. The whole 
region is filled up by such surfaces having values in some interval (yo, y\). 

Let the curvature of the graph of / at the point (x, f (x)) be k(x), where we 
have 

* w -i + (/W 

Near the point (x, f(x)), the graph is very close to a segment of a circle sub- 
tending an angle <p with radius R = 1 /(—k), where the minus sign is necessary 
because /" < at this point. So the solid line and the dotted line have lengths X 




and X' given, to first order, by 



X — R<p 
X' = (R + y)4>. 



Consequently, the strain e along the surface indicated by the dotted line is given 
by 

X' —X y<p y 
£ = X ~ ~R$~ R~ ~ yK ' 
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and the stress along this surface is 

a — sE — —}>kE. 

It is easy to check that for points where f" > we get exactly the same formula. 

For the fibre A through the point (x, f(x)), let x(x) be the total torsion on A, 
with respect to the point (x, f (x)), from all the external forces to the left of the 
point (gravity acting down on the portion of the board to the left, together with 
the upward force of any supports to the left). Since A isn't rotating, we must 
have the following, where b is the width of the plank: 



x(x) 



J A 



ryi 

= b a(x,y)-ydy 
Jyo 

ryi 

= -Ek(x) I by 2 dy 
J yo 

— —EIk(x), 

where 

ryi 

1= by 2 dy 
Jy» 

is, by definition, the moment of inertia of A, which we will assume can be taken 
to be a constant. 

This gives us the equation x(x) — —EIk{x). Finally, since /' is usually going 
to be extremely small, we simply throw away the fix) term in the expression 
for k (x), leading to the Euler-Bemoulli equation for plank bending, 

(*) -Elf'(x) = x(x). 



As a simple application of the Euler-Bernoulli equation, consider a plank of 
length a resting on three knife blade supports, two at the ends, and one in the 



a/2 a 









'1 




F 



middle. The plank, of weight W, is assumed to have uniform density w — W/a; 
the outside supports each exert an upward force of P and the middle support 
exerts an upward force of Q, with 2P + Q — W. For convenience, we choose 
the position of the x-axis so that our function / is at 0, a/2, and a. 
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For < x < a/2 we have 

x(x) = —Px + ^wx 2 , 



where the first term is the moment of the upward force P at distance x from our 
point, and the second term comes from the uniformly distributed force of w 
along the plank of length x to the left of our point. Thus 

EIf"(x) = Px-\wx 2 

and 

Px 1 wx 3 

EIf'(x) = — — + Q. 

z o 

There is another equation for a/2 < x < a, involving another constant Ci, but 
in this case we can use symmetry to dispense with the second expression. Since 
we clearly have /'(a/2) = 0, we can immediately solve for C\, to get 

Px 2 wx 3 wa 3 Pa 2 

EI f'{x) = + , 

J J 2 6 48 8 

and since /(0) =0 this gives 

Px 3 wx 4 ( wa 3 Pa 
EI f{x) = — 



24 V 48 8 
Finally, using /(a/2) = 0, and remembering that aw — W, this gives 

3 

P = — w. 

16 

So each end provides an upward force of jz W, while the middle support bears 
most of the weight, providing an upward force of 



I realized that I had the opportunity to end these lectures with a little exper- 
iment, balancing those provided in the first lecture, when I recalled that I had 
purchased a cute little kitchen scale at one of Tokyo's ubiquitous 100¥ shops. 




Wandering through Shibuya until I found the store again, I was gratified to see 
that they still had an ample supply, so for another couple hundred ¥, I was able 
to set up a test of our result. 
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Supplying three cardboard pieces to provide knife edge supports above each 
scale, I set a board upon them, and eagerly looked at the readings of three scales, 



confidently expecting the middle scale to register over 3 times the readings of 
the end scales. But, as I discovered, and showed at the lecture, the end scales 
actually registered much larger weights than the middle scale! 

This, of course, was somewhat disconcerting! To be sure, a 100¥ scale prob- 
ably isn't all that accurate, but only Descartes would try to explain away such 
a large discrepancy as experimental error! 

I then reflected that this particular arrangement did not actually correspond 
to the problem as originally posed, because we assumed that compressions of 
the supports were of negligible magnitude, so that we had /(0) = /(a/2). But 
now the compression of the supports (the scales) is certainly significant, since 
this compression is what causes the scale to register. 

So we might ask what happens when the end scales are compressed down by 
the amount Si, while the middle scale is compressed down by the amount 82- 
We still have the equation 

Px 2 mi 3 raa 3 Pa 2 

Elf'(x) = + , 

2 6 48 8 

since this depended only on the fact that /'(a/2) = 0. Using /(0) = —81 we 
then have 

Px 3 wx 4 ( wa 3 Pa 2 E\ 
EIf(x) = — — + — — x - EI8 h 



6 24 V 48 
so that / (a/2) = — 82 then gives 

Pa 3 wa 4 ( wa 3 Pa 2 \ a 



EI(8i - 82) = 



48 24-16 V 48 8/2' 

leading to 

24E 1(81-82) = a 3 l—W 

If we let X be the appropriate constant from Hooke's law for the identical 
springs on each of the scales, this can be written as 

24 EI 



24EI , , / 3 \ 

— iP -® = a [Te w - p ) 
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or 

24 EI 

k 



(3P-W) = a 2 ^j-W-P^j • 

Of course, it's the ratio P/W that interest us, so we write 



24EI ( P \ , ( 3 P 



A, V W 7 V 16 W 



which we can solve as 



P 

W 



Now this result at least seems reasonable at the extreme values for A.: If A. 
is very large (so that the springs on the scale are extremely strong and conse- 
quently their compression is very tiny), then we get the answer 



3a 3 24EI 

16~ + k 

. 12EI 
a 3 + — — 



W ~ 16 



obtained previously, when we assumed no compression at all. On the other 
hand, if k is very small, we get 



P 1 

W ~ 3' 



i.e., the weight is distributed nearly equally among the supports. 

However it seemed very unlikely that I would get an answer anywhere close 
to the results of the experiment if I inserted the proper value of k (determined 
easily enough by seeing how far down a scale was depressed when it registered 
a particular weight), because even for a piece of wood, E is enormous, on the 
order of 10 6 , so the formula would just give w ~ t- 

So I ended this series of lectures confessing my inability to provide a good 
answer for this final problem. Perhaps some day I'll be able to persuade a 
mechanical engineer to sit down and explain to a mathematician just how this 
problem really should be handled. 



~ THE END ~ 
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